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should  be  printed  on  8- 1  /  2  x  1 1  inch  papers  with  13/16  side  margins,  1-1/16  inch  top  margin,  and  1  inch  on  the 
bottom.  On  the  first  page,  place  title  1-1/2  inches  from  top  with  author  and  affiliation  beneath  the  title  Single 
spaced  type  using  10  or  12  point  front  size,  entire  text  should  be  justified  (flush  left  and  flush  nght).  No  typed 
page  numbers,  but  number  your  pages  lightly  in  pencil  on  the  back  of  each  page. 

SHORT  COURSES/HANDS-ON-WORKSHOP 

Short  courses  and  Hands-on-Workshops  will  be  offered  in  conjunction  with  the  Symposium  covering  numerical 
techniques,  computational  methods,  surveys  of  EM  analysis  and  code  usage  instruction  It  is  anticipated  that 
short  courses  will  be  conducted  principally  on  Monday  and  Friday.  Workshops  will  be  held  Monday,  Friday,  and 
Saturday  Short  Course  Fees:  Half-day  course  $100;  Full-day  course  $160.  Workshop  Fees:  Half-day 
workshop  $120;  Full-day  workshop  $235.  Fees  are  subject  to  change.  Quoted  fees  apply  only  if  booked 
before  1  March  2000. 

EXHIBITS 

Vendor  booths  and  demonstrations  will  feature  commercial  products,  computer  hardware  and  software  demon¬ 
strations,  and  small  company  capabilities. 
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PREFACE 


On  behalf  of  the  ACES  Technical  Program  Committee,  I  welcome  you  to  “The  15th  Annual 
Review  of  Progress  in  Applied  Computational  Electromagnetics.”  The  Symposium  has  been 
extended  to  six  days  this  year  beginning  Monday,  March  15  through  Saturday,  March  20,  1999, 
and  takes  place  at  the  Naval  Postgraduate  School  (NPS)  in  Monterey,  California.  The 
Symposium  is  sponsored  by  the  Applied  Computational  Electromagnetics  Society,  NPS,  and  the 
University  of  Nevada  Reno. 

Monday,  March  15,  Friday,  March  19,  and  Saturday,  March  20  are  devoted  to  short  courses  and 
workshops.  Technical  sessions  take  place  Tuesday,  March  16  through  Thursday,  March  18,  with 
an  Interactive  Session  and  Vendor  Exhibits  scheduled  for  Tuesday  afternoon.  This  year’s 
Symposium  features  twelve  technical  sessions  and  three  plenary  speakers.  Several  students  are 
also  participating  in  the  Student  Paper  Contest. 

Many  volunteers  cooperated  to  make  this  symposium  possible.  Symposium  Co-Chairs,  Professor 
Indira  Chatteijee  and  Professor  James  Henson  helped  review  papers  and  organize  sessions. 
Professor  Michael  Jensen  organized  short  courses  and  Professor  Keith  Whites  organized  vendor 
exhibits.  I  am  indebted  to  these  professors  for  their  efforts.  Professor  Richard  Adler  and  his  wife 
Pat  spend  many  days  making  all  the  arrangements,  getting  the  proceedings  published,  organizing 
registration,  and  training  me.  I'd  like  to  thank  Dr.  Bob  Bevensee,  the  ACES  Conference 
Committee  Chairman,  for  his  support,  and  to  Professor  Atef  Elsherbeni  for  collecting  the 
abstracts  for  posting  on  the  web  site.  Earlene  Absher,  Janel  Fontana,  and  Erwin  Coleman 
provided  significant  support  here  at  the  University  of  Nevada.  Finally,  a  big  thanks  to  the  session 
organizers  for  putting  together  some  excellent  sessions  and  our  plenary  speakers. 


Enjoy  the  15th  Annual  Review  and  your  stay  in  Monterey. 


Randy  Haupt 
Technical  Program  Chair 
1999  ACES  Conference 


ACES  PRESIDENT  S  STATEMENT 


The  ACES  Annual  Review  of  Progress  in  Applied  Computational  Electromagnetics  has  a  tradition  of 
excellence  that  guarantees  a  pleasant  and  educational  experience.  On  the  other  hand,  there  is  no 
guarantee  that  California  will  remain  attached  to  the  rest  of  the  continent.  So  long  as  it  does,  however, 
good  ACESians  everywhere  can  look  forward  to  a  Monterey  pilgrimage  during  the  third  week  of  March. 
The  Naval  Postgraduate  School  has  been  a  gracious  host  and  afforded  this  special  and  important  event, 
and  we  again  thank  Dick  and  Pat  Adler  for  their  many  hours  devoted  to  local  arrangements. 

Randy  Haupt  and  his  capable  conference  team  have  put  together  an  excellent  technical  program.  You 
should  know  that  ACES  has  intentionally  reduced  the  number  of  accepted  papers,  both  last  year  and 
continuing  again  this  year,  so  that  the  annoyance  of  multiple  parallel  sessions  might  be  substantially 
alleviated.  Many  registrants  still  remember,  with  some  fondness,  the  earlier  years  with  ''l"  parallel 
session.  The  opportunity  for  all  to  gather  and  hear  the  technical  program  in  a  single  auditorium  was 
a  distinctive  feature  of  the  ACES  conference  for  several  years.  Parallel  sessions  unavoidably  fragment 
the  audience  and  disrupt  the  personal  encounters  and  exchanges  that  were  hallmarks  of  our  early 
conferences.  As  a  practical  matter,  we  probably  cannot  go  back  to  the  "single-session"  format,  but  ACES 
is  committed  to  holding  the  line  at  a  maximum  of  three  parallel  sessions  in  the  future,  and  we  further 
aspire  to  generally  achieve  a  format  with  only  two  parallel  sessions.  This  is  slightly  short  of  ideal,  but 
a  good  compromise  and  a  heckuva  lot  better  than  you  find  at  most  conferences  elsewhere. 

Anyway,  we  recognize  that  conferences  are  more  than  just  paper  sessions,  short  courses,  and  award 
banquets,  and  ACES  is  actively  striving  to  enhance  your  opportunities  to  spend  some  quality  time  with 
colleagues  and  friends,  both  old  and  new.  If  you  see  a  way  that  we  can  further  improve  the  conference, 
please  do  let  us  hear  from  you! 

In  addition  to  enjoying  the  ACES  Annual  Review,  I  hope  you  will  similarly  enjoy  the  availability  of  ACES 
Publications  the  next  time  you  are  ready  to  publish!  The  ACES  Journal  and  ACES  Newsletter  are 
influential  information  outlets  for  the  CEM  community  worldwide,  and  you  are  invited  to  make  them 
your  publishing  vehicle  of  choice  in  the  future. 

Best  wishes  for  a  full  and  gratifying  1 5th  Annual  Review! 

Perry  Wheless 
ACES  President 
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ACES  1999  SHORT  COURSES/WORKSHOPS 


MONDAY  1 5  MARCH  1999 

0830-1630  SHORT  COURSES  (FULL-DAY) 

"Model-Order  Reduction  as  a  Means  of  increasing  the  Efficiency  of  Computational 
Electromagnetics" 

Andreas  Cangellaris,  University  of  Illinois  at  Urbana-Champaign,  Edmund  K.  Miller,  Santa  Fe,  NM. 
0830-1630  SHORT  COURSES  (FULL-DAY) 

"Finite-Difference  Time  Domain  Method",  Omar  Ramahi,  Compaq  Computer  Corp.,  Maynard,  MA. 

0830-1630  SHORT  COURSE#3  (FULL-DAY) 

"Finite  Elements  for  Electromagnetics",  Scott  Savage,  Ansoft  Corporation,  Pittsburgh,  PA. 


PLEASE  NOTE  THAT  A  10%  DISCOUNT  IS  IN  EFFECT  FOR  ALL  WORKSHOPS  TAKEN  AFTER  ATTENDING  AN  INITIAL 
WORKSHOP  THIS  APPLIES  TO  THE  GROUP  OF  WORKSHOPS  FOR  MATLAB  AND  MATHCAD.  IT  ALSO  APPLIES  TO  THE 
NEC  WORKSHOPS.  NOTE  THAT  MATLAB/MATCAD  DISCOUNTS  DO  NOT  APPLY  TO  THE  NEC  WORKSHOPS,  AND  VICE 
VERSA. 


0830-1 1 30  HANDS-ON-WORKSHOP#4  (HALF-DAY) 

"Basic  MATHCAD"  (Introduction  for  Workshop  #9) 

Jovan  Lebaric  and  Robert  Vitale,  Naval  Postgraduate  School,  Monterey,  CA. 

1 330-1 630  HANDS-ON-WORKSHOPS5  (HALF-DAY) 

"Basic  MATLAB"  (Introduction  for  Workshop  #1 1 ) 

Jovan  Lebaric  and  Robert  Vitale,  Naval  Postgraduate  School,  Monterey,  CA. 

FRIDAY  19  MARCH  1999 

0830-1130  SHORT  COURSE  #6  (HALF-DAY) 

“Verification  and  Validation  of  Computational  Electromagnetics  Software 
Edmund  K.  Miller,  Santa  Fe,  NM 

1330-1630  SHORT  COURSE  #7  (HALF-DAY)  .  . 

“A  Survey  and  Comparison  of  Computational  Electromagnetics  Options 
Edmund  K.  Miller,  Santa  Fe,  NM 

0830-1630  SHORT  COURSE#8  (FULL-DAY) 

“Genetic  Algorithm  Optimization” 

Randy  L.  Haupt,  and  J.  Michael  Johnson,  University  of  Nevada,  Reno,  NV 

0830-1630  HANDS-ON-WORKSHOP  #9  (FULL-DAY) 

“Method  of  Moments  (MoM)  Using  MATHCAD” 

Jovan  Lebaric  and  Robert  Vitale,  Naval  Postgraduate  School,  Monterey,  CA. 

0830-1630  HANDS-ON-WORKSHOP  #10  (FULL-DAY) 

“Introduction  to  Using  NEC  under  Windows” 

J.K.  Breakall,  Penn  State  University,  G.J.  Burke,  LLNL,  R.W.  Adler,  Naval  Postgraduate 
School,  Monterey,  CA. 

SATURDAY  20  MARCH  1999 

0830-1 630  HANDS-ON-WORKSHOP  #1 1  (FULL-DAY) 

“Finite  Difference  (FD)  Using  MATLAB” 

Jovan  Lebaric  and  Robert  Vitale,  Naval  Postgraduate  School,  Monterey,  CA 

0830-1 630  HANDS-ON-WORKSHOP  #1 2  (FULL-DAY) 

“Advanced  Applications  using  NEC4.1  under  Windows” 

J.K.  Breakall,  Penn  State  University,  G.J.  Burke,  LLNL,  R.W.  Adler,  Naval  Postgraduate 
School,  Monterey,  CA. 
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AGENDA 

The  Fifteenth  Annual  Review  of  Progress  in  Applied  Computational  Electromagnetics 

NAVAL  POSTGRADUATE  SCHOOL 
15-20  March  1999 

Randy  Haupt,  Technical  Program  Chairman 
Indira  Chatterjee,  Symposium  Co-Chairman 
James  Henson,  Symposium  Co-Chairman 
Michael  Jensen,  Short  Course  Chairman 
Keith  Whites,  Vendor  Chairman 
Richard  W.  Adler,  Symposium  Administrator 

MONDAY  MORNING  15  MARCH  1999 

0700-0730  CONTINENTAL  BREAKFAST  Glasgow  Courtyard 

(For  short  course  and  hands-on-workshop  attendees  ohly ) 

0730-0820  SHORT  COURSE/HANDS-ON-WORKSHOP  REGISTRATION  Glasgow  103 

0830-1630  SHORT  COURSE  #1  (FULL-DAY) 

"Model-Order  Reduction  as  a  Means  of  Increasing  the  Efficiency  and  Effectiveness  of  Computational  Electromagnetics” 
Andreas  Cangellaris,  University  of  Illinois  at  Urbana-Champaign,  and  Edmund  Miller,  Santa  Fe,  NM. 

0830-1630  SHORT  COURSE  #2  (FULL-DAY) 

“Finite-Difference  Time-Domain  Method” 

Omar  Ramahi,  Compaq  Computer  Corporation,  Maynard,  MA 

0830-1630  SHORT  COURSE  #3  (FULL-DAY) 

"Finite  Elements  for  Electromagnetics”,  Scott  Savage,  Ansoft  Corporation,  Pittsburgh,  PA. 

0830-1130  HANDS-ON-WORKSHOP  #4  (HALF-DAY) 

"Basic  MATHCAD”  (Introduction  for  Course  9)  Jovan  Lebaric  and  Robert  Vitale,  Naval  Postgraduate  School,  Monterey, 
CA. 

0900-1200  CONFERENCE  REGISTRATION  Glasgow  103 

1130-1315  BOD  BUSINESS  LUNCHEON  Del  Monte  Room 

MONDAY  AFTERNOON 

1330-1630  HANDS-ON-WORKSHOP  #5  (HALF-DAY) 

"Basic  MATLAB”  (Introduction  for  Course  #11)  Jovan  Lebaric  and  Robert  Vitale,  Naval  Postgraduate  School,  Monterey, 
CA. 

1400-2000  CONFERENCE  REGISTRATION  Glasgow  103 

MONDAY  EVENING 

1900  PUBLICATIONS  DINNER 
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TUESDAY  MORNING  16  MARCH  1999 
0700-0745  CONTINENTAL  BREAKFAST 

0745  ACES  BUSINESS  MEETING  President  Perry  Wheless 

0800  WELCOME  Randy  Haupt 

0815  PLENARY  SPEAKER:  Reinaldo  Perez,  JPL 

“Computational  Electromagnetics  in  Spacecraft  Design” 

SESSION  1 :  FDTD  (Parallel  with  Session  2) 

Chair:  Atef  Elsherbeni 

0920  "A  Finite-Difference  Time-Domain  Algorithm  Based  on  the  Recursive  Convolution  Approach 
for  Propagation  of  Electomagnetic  Waves  in  Nonlinear  Dispersive  Media” 

0940  "Methods  for  Including  Lumped  Elements  in  FDTD  Simulations" 

1000  "A  Novel  Subgridding  Scheme  for  Refractive  Index  Adaptive  FDTD" 

1020  BREAK 

1 040  "Characterizing  Meander  Line  Antennas  for  RF  and  Wireless  Communications  Using  the  FDTD 
Techniques" 

1 1 00  'Impulse  Response  of  Microwave  Devices  by  FDTD  and  Moment  Expansion" 

1120  The  Cause  of  Oscillatory  Dispersion  Relation  of  Equivalent  Circuit  Parameters  for  Microstrip 
Component  Using  FDTD  Method" 

1140  "An  Implicit  Characteristic  Based  Finite  Difference  Method  for  Computational  Electromagnetics 


1200-1300  PUBLICATION  COMMITTEE  BUSINESS  &  LUNCHEON 

1200  LUNCH 

SESSION  2:  ANTENNAS  (Parallel  with  Session  1) 

Chair:  Nathan  Cohen  Co-Chair:  Jacob  Kim 

0920  "End  Fed  Antennas' 

0940  'Investigation  of  a  Multi-Resonant  Antenna  for  Wideband  Application" 

1000  "Computation  of  Clustered  Phased  Array  EM  Fields  Using  Particle  Accelerator  Codes 

1020  BREAK 


Glasgow  Courtyard 
Glasgow  102 
Glasgow  102 
Glasgow  102 


S. J.  Yakura, 

J.T.  MacGillivray. 

D.  Dietz 

Ian  Rumsey,  Jason  Mix, 
Melinda  Piket-May 

T. O.  Komer,  W.  Fichtner 


Chun-Wen  Paul  Huang 
Jiang-Bin  James  Chen 
Atef  2.  Elsherbeni 
Charles  E.  Smith 

Gaetano  Marrocco 
Fernando  Bardati 

Lee  L.  Liou 


John  H.  Beggs 
W.  Roger  Briley 

Del  Monte  Room 


R.P.  Haviland 

P.L.  Werner,  N.V.  Veremey, 
R.  Mittra  and  D.H.  Werner 

Ross  A.  Speciale 
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TUESDAY  MORNING  16  MARCH  1999 


SESSION  2:  ANTENNAS  (cont) 

1040  "Extended  Ray-Optical  Technique  for  Installed  Performance  of  Large  Airborne  Antenna  Systems” 

1 100  "Antenna  Imaging  and  Visualization” 

1120  "Are  Fractals  Naturally  Frequency  Invariant/Independent?" 

1 140  "The  Ultimate  Antenna  Training  Aid" 

1200  LUNCH 

TUESDAY  AFTERNOON 

INTERACTIVE  POSTER  SESSION 
1300-1530 

VENDOR  EXHIBITS 
1300-2000 

WINE  AND  CHEESE  BUFFET 
1500-1700 

SESSION  3  INTERACTIVE  POSTER  SESSION 

1300-1530 

"Extension  of  the  Method  of  Lines  for  Planar  3D  Structures" 

"Noise  issues  in  Submicron  Design" 

"Numerical  Calculations  for  the  Back  Scattering  and  Extinction  Cross  Sections  of  Plane  Waves 
Incident  Obliquely  on  an  Eccentric  Multi-layered  Cylinder" 

"Using  the  Partial  Element  Equivalent  Circuit  (PEEC)  Simulation  Technique  for  EMC  Applications" 

"Analytic  Validation  of  a  Low-Frequency  Magnetic  Induction  Code  with  Nonuniform  Sources" 

“A  Study  of  Finite  Element  and  Cylindrical  Perfectly  Matched  Layer  Approach” 

"Conditioning  of  Bi-orthogonal  Wavelet  Transforms  and  Iterative  Solvers  for  Electromagnetic 
Integral  Equations" 

“Exact  Implementation  of  Bayliss-Turkel  Boundary  Operators  in  2D  and  3D  Scalar  and  Vector 
Wave  Equations" 

"Formulae  for  Instantaneous  Power  and  Time-Average  Power  Delivered  to  Charge-Current  Distributions" 
“CEM  for  RCS  Measurement  Calibration:  Some  Parametric  Observations" 

“Analytical  Bistatic  K  Space  Images  Compared  to  Experimental  Swept  Frequency  ISAR  Images” 

“A  Hybrid  Periodic  Moment  Method  Formulation  for  Scattering  from  Large  2-D  Wire  Arrays' 

“Accuracy  and  Conditioning  of  the  Method  of  Moments  for  the  2D  EFIE" 

“hp-Adaptive  FE  Modeling  for  Maxwell's  Equations-  -  Evidence  of  Exponential  Convergence " 


Jacob  J.  Kim,  Kuang  Wu, 
Sidney  W.  Theis 
Oren  B.  Kesler 

John  Shaeffer 
Brett  Cooper 

Nathan  Cohen 

Alan  Nott 
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Kueichien  C.  Hill 

John  Shaeffer 
Kam  Horn 

Peter  J.  Collins 
J.  Paul  Skinner 
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L.  Demkowicz 
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TUESDAY  EVENING  16  MARCH  1999 
1900  BOD  DINNER 

WEDNESDAY  MORNING  17  MARCH  99 
0700-0800  CONTINENTAL  BREAKFAST 

0815  PLENARY  SPEAKER:  Magdy  Iskander,  University  of  Utah 

“Multimedia  -  and  Computer-Based  Engineering  Education” 

SESSION  4  :  FINITE  ELEMENTS  I  (Parallel  with  Session  5) 

Chair:  Leo  Kempel  Co-Chair:  Jianming  Jin 

0920  “Electromagnetic  Modeling  of  Microstrip  Structures  in  Multilayer  Media" 

0940  "Modeling  Wideband  Conformal  Antennas  Using  Finite  Element-Boundary  Integra!  Techniques” 
1 000  "The  Northrop-Grumman  Duct  RCS  Prediction  Code" 

1020  BREAK 

1 040  "Implementation  of  Various  Hybrid  Finite  Element-Boundary  Integral  Methods:  Bricks.  Prisms, 
and  Tets" 

1 1 00  "Fast  Hybrid  Finite  Element  Methods  and  Their  Applications  for  Conformal  Antennas" 

1120  "A  Hybrid  MoM/FEM  Method  for  Scattering  from  a  Complex  BOR  with  Appendages" 

1 140  "Frequency-Domain  Complementary  Operators  for  Finite  Elements  Simulation" 

SESSION  5:  MODEL-BASED  PARAMETER  ESTIMATION  (Parallel  with  Session  4) 

Chair:  Andreas  Cangellaris  Co-Chair:  Edmund  Miller 

0920  "Model-Based  Frequency  Extrapolation  of  Antenna  Radiation  Characteristics  on  Complex 
Platforms” 

0940  "Developing  Adaptive  Models  and  Estimating  the  Uncertainty  of  Presampled  Spectral  Data" 
1000  "Techniques  for  Model-Based  Parameter  Estimation  of  Antenna  Radiation  Patterns” 

1020  BREAK 

1040  "Full  Wave  Analysis  of  Generalized  Microstrip  Lines  Using  Model  Order  Reduction  Techniques 
1 1 00  "Frequency  and  Angular  Extrapolations  in  Hybrid  Finite  Element-Boundary  Integral  Systems 

1120  "Reduced-Order  Modeling  of  Three-Dimensional  Electromagnetic  Wave  Fields" 

1 1 40  "Rapid  Simulation  of  Electromagnetic  Resonant  Structures  Using  Model  Order  Reduction  and 

Unstructured  Finite  Difference" 

1200  LUNCH 
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WEDNESDAY  AFTERNOON  17  MARCH  1999 


SESSION  6:  OPTIMIZATION  (Parallel  with  Session  7) 

Chair:  John  Volakis  Co-Chair:  Zachi  Baharav 

1320  "Recent  Results  in  Electromagnetic  Optimization  of  Microwave  Components  Including  Microstrip 
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The  business  of  building  satellites  has  been  around  for  over  30  years.  Over  the  last 
several  years  considerable  improvements  have  been  made  in  the  design,  development  and 
manufacture  of  satellite  subsystems  in  order  to  accommodate  the  increasing  needs  of 
wireless  communications  companies  and  the  needs  of  purely  scientific  missions  from 
NASA  and  other  government  agencies.  We  discuss  in  this  work  some  of  the  advances 
done  in  spacecraft  which  are  built  for  scientific  missions;  specifically  small  spacecraft 
designed  for  the  exploration  of  outer  planets  and  other  celestial  bodies.  In  this  work  we 
discuss  the  role  played  by  computational  electromagnetic  in  assisting  in  some  of  the 
design  aspects. 
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A  Finite-Difference  Time-Domain  Algorithm  Based  on  the  Recursive  Convolution  Approach 
for  Propagation  of  Electromagnetic  Waves  in  Nonlinear  Dispersive  Media 

S.  J.  Yakura,  J.  T.  MacGillivray  and  D.  Dietz 
Air  Force  Research  Laboratory,  Directed  Energy  Directorate 
Kirtland  AFB,  NM  87117-5776 
Phone:  (505)  846-0995;  Fax:  (505)  846-0566 
E-mail:  yakuras@plk.af.mil 

Abstract 

We  present  for  the  first  time  a  successful  formulation  of  a  finite-difference  time-domain  algorithm  that  is  based  on 
the  recursive  convolution  approach  and  is  used  to  evaluate  the  propagation  of  electromagnetic  waves  innonlmear 
dispersive  media.  We  treat  in  particular  the  case  where  the  nonlinear  polarization  term  depends  only  on die  product 
of  the  square  of  the  electric  field  and  the  third-order  electric  susceptibility  function.  Ho™™  the  °PP™£  ls 
general  and  we  can  easily  extend  the  formulation  to  the  product  of  any  power  of  the  electric  field  with  the 
dectric  field  susceptibility  function.  We  find  that,  in  contrast  to  the  usual  formulation  for  linear  dispersive  materials 
which  uses  a  simple  linear  relationship  between  the  next-time-step  electric  field  and  the  previous-time-step  elei One 
field,  the  formulation  for  nonlinear  dispersive  materials  with  a  third-order  susceptibility  function  resu ihs  in  an 
algebraic  cubic  equation  which  relates  the  next-time-step  electric  field  to  the  previous-time-step  electric  field. 
Consequently,  the  algebraic  cubic  equation  must  be  solved  at  each  time  step  to  advance  the  electric  field. 

L  INTRODUCTION 

There  has  been  considerable  recent  interest  in  understanding  the  transient  behavior  of  an  ultrafast  laser  P^e  that 
interacts  with  a  nonlinear  dispersive  material.  In  the  last  several  years  many  expenmentahsts  have  made  use  of 
newly  developed  Kerr  lens  mode-locked  titanium-sapphire  lasers  to  perform  well-controlled  experiments  to  obfcu 
accurate  measurements  of  the  transient  behavior  of  ultrafast  laser  pulses  in  simple  molecular  liquids *  and i  solids 
which  are  known  to  exhibit  nonlinear  optical  behavior  [1].  To  better  understand  the  details  of  nonlinear  prwesses 
that  are  observed  in  the  experiments,  numerical  simulations  have  been  used  extensive  y  to  repr  “ 
nonlinear  effects.  Until  recently  most  computer  simulation  has  been  performed  by  solving  an  approxima 
MaxwelF  sequations,  known  as'  the  generalized  nonlinear  Schrodinger  (GNLS)  equation  [2]  to  get  mform^on 
about  the  time  evolution  of  the  envelope  of  the  propagating  oscillating  wave  packet  m  order  to  . obtain ^overall 
shape  of  the  propagating  optical  pulse.  Because  a  GNLS  equation-based  computer  simulation  does  not  provide  any 
information  abom  the  details  of  the  oscillating  waves  inside  the  envelope  of  the  optical  pulse,  there  is  a  renewed 
interest  in  solving  Maxwell’s  equations  directly  without  having  to  rely  on  any  approximations.  . 

Happily,  with  the  advent  of  present  day  computers  which  provide  very  fast  execution  times Sons 
of  computer  memory,  we  are  at  the  point  where  we  have  enough  computational  power  to  solve  Maxwell  s  equaOons 
directly  for  nonlinear  dispersive  materials.  Among  recently  investigated  JkTJd  on 

promise  in  achieving  this  goal  is  the  well-known  finite-difference  time-domain  (FDTD)  method  [31.  It  is  based  on 
E^lple  differencing  scheme  in  both  time  and  space  to  calculate  the  transient  behavior  «***£**»> 
(EM)  field  quantities.  Because  of  the  usefulness  of  the  FDTD  method  for  many  optical  applications,  recmit 
researchers  have  focused  their  attention  on  the  numerical  evaluation  of  the  linear  and  nonlinear |  ?SSSf 
which  appear  in  one  of  Maxwell’s  equations  (Ampere's  Law)  as  convolution  integrals  An  efficient  evalua 
these  terms  allows  us  to  model  linear  and  nonlinear  dispersive  effects  more  effectively  [4-9]. 

The  designation  of  tenns  as  linear  and  nonlinear  depends  upon  the  form  of  the  mtegrmid 
convolution  integral  that  relates  the  displacement  field  vector,  flfr.il,  to  the  electric  field  vector,  gr.il-  For  bnear 
dispersive,  isotropic  materials,  the  relationship  between  D(t;x)  and  E(t;x)  is  usually  expressed  as 


Dft;x)  =  ece„  E(t;x)+e0  ^  j E( t; x)X(pn(t-T ) dr , 


where  e0  is  the  electric  permittivity  of  free  space,  is  the  medium  permittivity  at  infinite 

is  the  pt h  term  of  the  collection  consisting  of  Pmax  time  dependent,  first-order  electric  susceptibility  functions  where 
Pmax  is  the  maximum  number  of  terms  which  we  choose  to  consider  for  a  partmuiarformuJationof  Eq.  CUM* 
isotropic  materials  that  exhibit  both  linear  and  nonlinear  polarization  properties,  specifically  through  the  first-order 
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(linear)  and  third-order  (nonlinear)  electric  susceptibility  functions,  Xj^ft-r)  and  Xp3>(t, %ti,t2),  respectively,  the 
relationship  between  D(t;x)  and  E(t;x)  can  be  expressed  as 

Dft;x)  =  e0e„  E(t;x)+e0  £  jE(T;x)X(pn(t-r)dr 

p  —> 

+e0  £  J  J  ]mt2;x)[ E(t1;x).E(T;x)]X(3i(t,T,t1,t2  )dt2  dt ,  dr,  (1.2) 

P 

where  Xp3i(t,%tht2)  is  the  pth  term  of  the  four-time  dependent  third-order  susceptibility  function  which  contributes 
to  the  nonlinear  behavior  of  the  material  and  •  is  the  notation  used  for  the  dot  product  of  vectors.  When  Xj3)(t,  %t,,t2) 
is  reduced  to  the  single-time  dependent  susceptibility  function,  xj^ftrh),  by  making  use  of  the  following  Bom- 
Oppenheimer  approximation  [10]: 

X(3)(t,r,t,,t2  )=S(r-t1  )S(t-t2  )[ rf3)(t2  -tj  )+S(t2  -t)  o/03JJ ,  (1.3) 

where  ol3p  is  a  constant  and  S(t)  is  the  Dirac  delta  function,  we  can  show  that  Eq.  (1.2)  reduces  to  an  expression  that 
consists  of  sums  of  convolution  integrals  of  linear  and  nonlinear  terms;  namely, 

D(t;x)  =  e0e„  E(t;x)+ea  jE(r;x)XfpJ,(t-r)dr 

p  -~ 

+60  E(t;xj%  j[E(r;x)*E( r;x )] rf3)(t- r]dr+e0  E(t;x)[ E(t;x)»E(t;x)]^a<3>  .  (1.4) 

p  —  p 

Based  on  the  above  expression,  this  paper  provides  a  general  formulation  of  the  FDTD  method,  which  we  call  the 
piecewise  continuous  recursive  convolution  (PCRC)  approach,  to  evaluate  the  linear  and  nonlinear  convolution 
integrals.  We  investigate  in  particular  the  case  in  which  both  the  first-order  and  third-order  electric  susceptibility 
functions  are  expressed  as  complex  functions  that  contain  complex  constant  coefficients  and  exhibit  exponential 
behavior  in  the  time  domain  as  follows: 

X(J>(t)= Ref  aLP  exp  (-fit)}  U(  t) ,  (1.5) 

and 

X‘jl<t)=Re{aiLexp<^?t)IU(t),  (1.6) 

where  Ref  }  is  used  to  represent  the  real  part  of  a  complex  function,  U(t)  is  the  unit  step  function,  and  ap  ,  apL  ,  yp 
and  ypL  are  complex  constant  coefficients;  superscripts  L  and  NL  are  used  to  distinguish  between  linear  and 
nonlinear  coefficients.  By  making  the  proper  choices  of  complex  constant  coefficients  and  performing  Fourier 
transforms,  we  can  readily  obtain  the  familiar  Debye  and  Lorentz  forms  of  the  complex  permittivity  in  the  frequency 
domain. 

IL  GOVERNING  EQUATIONS  AND  GENERAL  FORMULATION  OF  THE  RECURSIVE 
CONVOLUTION  APPROACH 


In  light  of  Eq.  (1.4),  Maxwell’s  equations  inside  the  dispersive  material  can  be  written  as 
d[p  H( t;x)  ] 

Y*E(t;x)  =  — ~-r  -  , 

dt 

VxH(t;x)=  d-(':-)+aE(t;x), 

0  t 

D(t;x)~eae„  E(t;x)+ec  £ PLp(t;x ) 

p 

E(t;xj£p»L(t;x)+e0  E(t;x)[ E(t;x)*E(t;x) ]^af03p> , 


(2.1) 

(2.2) 

(2.3) 
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Pp(t;x)  =  fE(r;x)X(J>(t~^)dT,  (2'4) 

P^(t;x)=j[E(T;x)*E(T;x)]/p3,(t-T)dT,  (Z5) 

where  H(t;x)  isThe  magnetic  field  vector,  p  is  the  magnetic  permeability,  and  PpL(t;x)  and  « 

related  to  *e  pth  terms  of  linear  and  nonlinear  polarization  field  vectors,  respectively  Us.ng  an  roTO  algonthm 
the  above  equations  can  be  solved  numerically  at  each  time  step  provided  we  can  handle  Pp  (t.x)  and  Pp 
numerically  Therefore,  the  whole  solution  rests  on  the  question  of  how  to  carry  out  the  numencal  eyaluah  f 
P  L(rx)  and  PpNL(t;x)  at  each  successive  time  step.  For  that  reason,  the  rest  of  this  section  is  devoted  to  the  numerical 
formulation  that  treats  PpL(t;x)  and  PpNL(t;x)  based  on  the  recursive  convolution  approach  Diecewise 

To  obtain  second-order  accuracy  in  time  in  evaluating  the  convolution  integrals,  is  token  to  beap  eceunse 
continuous  function  over  the  entire  temporal  integration  range  so  that  ^  changes  linearly  w.th ^  resect .  to  time 

over  a  given  discrete  time  interval  [mAt,  0 m+WL  where  m=0,l . «-/,  with  nAt  being  die  current  ume  stop  [Uj. 

Referring  to  Fig.  1 ,  we  can  express  E(t;x)  in  the  following  form  in  terms  of  the  electric  field  values.  Eg  and  , 
which  afe,  respectively,  evaluated  at  discrete  time  steps  t=mAt  and  t=(m+I)At  and  at  the  same  discrete  spatial 
location  *  =  (i iAxJAy.kAz),  with  Ax,  Ay  and  Az  being  the  spatial  grid  sizes  in  the  *,  y  and  z  directions,  respectively 
{we  use  a  superscript  to  designate  the  discrete  time-step  and  a  subscript  for  the  discrete  spatial  location). 


E(t;x)={  E”  + 


(t-mAt)  }  for  0  <  mAt  <t<(m+l)At<  nAt;  and 


E(t;x)=  0  for  t<0.  (Z6) 

When  Eq.  (2.6)  is  substituted  into  Eq.  (2.4),  we  obtain  after  some  manipulation  the  following  expression  for  discrete 
values  of  Pp(t;x)  at  discrete  time  step  nAt  and  discrete  spatial  location  (iAxJAy.kAz)  [12]. 

PLp  ( nAt;  iAx,  jAy,  kAz)  =  (Pp  ),"*  s  Re{ ( Qp  )"jk  }, 
where  the  discrete  complex  values,  ( Qp)ijk  ,  2TS  defined  as 

<Q.lp>I>  -1  >(<'r  1  <2'8) 

m=0 

with 

(n-m)Al 

(VLpjD)nm=  aLP  \exp(-yLpr)dr, 

(n-m-l)At 

(¥p.i)nm=  —  m\[(n-m)At-z)exp(-YLpT)dT.  (210) 

Similarly,  substituting  Eq.  (2.6)  into  Eq.  (2.5),  we  obtain  after  some  manipulation  the  following  expression  for 
discrete  values  of  PpL(t;x)  at  discrete  time  step  nAt  and  discrete  spatial  location  (lAxjAy.kAz)  l  J- 

pNpL(  nAt;  iAx,  jAy.kAz)}  =  (  PpL)?jk  =  Re((  QpL)?jk  /  * 
where  the  discrete  complex  values,  (QpLhjk  » are  defined  as 

ffl?*  -x tn  -a  >(<*r 

m=0 

het1  -a  -e.’t  xv%r  1 .  <212) 


<  n-mW 

( y/NpL0  rm  =  af  \exp(-yNpLr)dT, 
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NL  , „  ... 

(¥pj)n'm=  j[(n-m)At-T]exp(^LT)dr,  (2.14) 

(a-m-I)A 
„NL  (n-m)Ot 

(¥pL2)n'm=  — P~  \[(n~m)At-t)2  exp(-YpLZ)dT .  (2.15) 

After  some  manipulation  we  can  show  that  the  y/s  defined  in  Eqs.  (2.9),  (2.10),  (2.13),  (2.14)  and  (2.15)  satisfy  the 
following  recursive  relationships  relating  the  next  time  step,  (n+])Ai,  to  the  current  time  step,  nAt : 

(¥Lp,rLm  =exp(- yLpAt)(¥Lpor ,  (2.16) 

(¥LpjrLm  -exp(-  ypAt )( ¥p,)nJn ,  (2.17) 

(VNpLo)n+lm  =exp{-YNpLAt)(¥ffi /■*,  (2.18) 

=exp(- YpLAt)(yrNpL1)R’m,  (2.19) 

(WNp^rhm  =exp(- YpLAt)(¥NpL2)n-m .  (2.20) 

It  is  important  to  mention  that  we  are  able  to  obtain  the  above  recursive  relationships  only  because  both  the  linear 

and  nonlinear  susceptibility  functions  are  expressed  in  the  exponential  forms  shown  in  Eqs.  (1.5)  and  (1.6). 

When  Eqs.  (2.16)  through  (2.20)  are  used  in  Eqs.  (2.8)  and  (2.12),  respectively,  for  the  next  discrete  time  step,  we 
obtain  the  following  expressions  for  (Pphjk*‘  and  ( PpL)ijk*‘  based  on  the  recursive  relationships  obtained  for 
(Qphjk*1  and  (<2/V+/  in  terms  of  V .  (i &V  and  «2/V  Eqs.  (2.16)-(2.20): 

(pLP)?k‘  -  **{(<£>»’  l=Re(  ‘xpi-fatHQ^k  +Ey¥LPJOr]*  +(e*;‘ -E-jk  )(¥LPJrln } 

=  Re{exp(- Y^AtHQ^k  +^(^.0>A°  +fl£#  ~E*jk  )(¥LpJ)W  },  (2.21) 

and 

(P?)$'-R‘(<Q?>$' )  $,+£*  •Ey^)"*1-" 

»<C,r‘-"  +(e£ -£*,  wi?  -i*  xrjr" ) 

+£■, 

+  2  e&t&'-etHvMp  *<SS! -£*>•<$? p-22) 

In  the  above  expressions,  (  0  )/  0  •  (¥£ , )‘  °  >  (¥%L0  )''°  »  ( yr  )y  °  and  (  ¥pL2  )y,°  can  be  evaluated  explicitly 

from  Eqs.  (2.9),  (2.10),  (2.13),  (2.14)  and  (2.15)  in  terms  of  complex  linear  and  nonlinear  susceptibility  coefficients 
ap  ,  apL,  yp  and  ypL  and  At  as  follows: 

<Kjo  =  aP  J  exP  < ~^p  T  )  dT  = ff  t 1 ~  exP  ( ^  H  ’  (123) 

o  0> 

(¥Lpj)I/>s^j(At-r)exp(-YLpT)dT=^E.{l-~^[J-exp(^YLpAt)]},  (2.24) 

&  NL 

(^)l»*af\exp(^pLX)dx~^ -Jl-expf^M)},  (2.25) 

o  <p 

„NL  *  „NL  . 

<W**^\<*-*)exp(^T)dt^{l--^[l-*p(-tfA)]},  (2-26) 

o  fp  fp  m 
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To  demonstrate  how  the  above  terms  are  used  in  the  FDTD  calculations,  we  consider  the  one-dimensional  case 
where  the  propagating  wave  vector,  k,  lies  along  a  major  axis  in  Cartesian  coordinates.  The  analysis  can  be  extended 
easily  to  two  and  three-dimensional  cases,  where  the  EM  wave  is  propagating  in  any  arbitrary  direction,  based  on 
following  the  sample  one-dimensional  formulation  presented  below.  For  our  one-dimensional  case,  we  arbitrarily 
pick  the  k  vector  to  lie  in  the  x  direction,  Dy  and  Ey  field  vectors  in  the  y  direction  and  the  Hz  field  vector  in  the  z 
direction.  When  Eqs.  (2.1)  and  (2.2)  are  differenced  in  both  time  and  space  using  the  usual  Yee  algorithm  [3],  we 
have 


'(Dr1  ),HD?)y 

A: 

Ax 

(2.28) 


( &?$ k~  ( &"$ h 

'(Er  )y-<E?)y' 

Ai 

Ax 

where  subscripts  y  and  z  refer,  respectively,  to  the  y  and  z  components  of  the  field  vectors.  Based  on  expressions 
obtained  in  Eqs.  (2.3),  (2.21)  and  (2.22),  the  left-hand  side  of  Eq.  (2.28)  can  be  expressed  in  terms  of  (E,  )y  and 
(Ein)y  as 


(Dr1  K  -(£>?  h  =  l(Er’  )y  ~(E?  )y  ] 

+  e0'^Re{  [exp(~Yp/ii )- 1][(  QLp)i  ]y  +(E?  )y(WLpJ0  )W  +  )y  -< E )y  l(VLpJ  f°  } 

+  €o  ^{[[(Er1  K  f  ~K  E"  )y  l3  ]  a®  +  /( )y^pi if  *  )  -(#  h  KQ^t 
+(E r'  )yt(E”  )y  ]2(wfh  ),0  +  2(Er'  )y(E*  )yl(Er'  )y -(E ?  )y  l(WpLi )'° 

H Er 1  )y l( E-+1  )y  -<E*  )y  )2( ¥pL2  )w  } •  (2-30) 

When  Eq.  (2.30)  is  substituted  into  Eq.  (2.28),  the  following  algebraic  cubic  equation  results  in  which  we  need  to 
solve  for  (£,"+/)v  in  terms  of  known  quantities  a0,a,,  a2  and  a3  at  t=(n+l)At: 

a0  +  a,  ( E^+ '  )y  +a2  K E"+ '  )>  ? +<*3  MET'  h?  =  °'  (231) 


where  a0 ,  <2/ ,  a2  and  a3  are  given  by 


°°  ^  jtl( )z  ~  (  )zl  +  {E‘}>aAt~  ( E"  }>  €°£~ 

+  e0  [ exp( -  y^At )- 1  ][( Qp )"  ]y-[(E?  )yJ3  d03J  ~  (Ej  )y(Qp  h 

HE?)yl(VLpjo)W-<VLpj),S>l}' 
a,  +€c  ^Re{exp(-r^At)(QNpL)i  +(¥p.i  )>X> 

+[( E*  )y  ]2[(¥pfi  )W-2(¥pj  )W  '(Vpa  )W]}> 

a2  =2  €0  ( E?  )y  [(¥%  )“>  -  (¥%  )’*  ]}, 

a3  Ref  d03J+(  ¥pL2  )W  }  ■ 


(2.32) 

(2.33) 
(2-34) 

(2.35) 


Eq.  (2.31)  can  be  solved  for  (EP*')y  by  using  any  standard  root-finding  numerical  technique.  Hence,  for  complete 
computer  simulation  of  the  electric  field  response  in  nonlinear  dispersive  materials,  we  only  need  to  consu der 
updating  Eqs.  (2.21),  (2.22),  (2.29)  and  (2.31),  respectively,  for  [(Qp L)"  ]>■ >  (Qp  h  *  ^  and  (E‘  ^  at 

each  time  step. 
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For  the  purely  linear  dispersive  case,  a2  and  a3 ,  as  well  as  some  terms  appearing  in  a0  and  dj ,  turn  out  to  be  zero. 
In  this  case  we  can  solve  for  (£'"+;)).  directly  without  having  to  rely  on  the  numerical  root  finding  technique;  this 
case  is  discussed  in  the  literature  on  computational  schemes  for  linear  dispersive  materials  [13-18]. 

m.  EQUIVALNCE  OF  THE  RECURSIVE  CONVOLUTION  APPROACH  TO  THE 
AUXILIARY  DIFFERENTIAL  EQUATION  APPROACH 

An  alternative  technique  for  solving  the  nonlinear  dispersive  problem  is  the  so-called  auxiliary  differential 
equation  approach.  The  auxiliary  differential  equation  approach  is  in  fact  equivalent  to  the  recursive  convolution 
approach  as  we  now  show. 

To  begin,  we  first  substitute  Eq.  (1.5)  into  Eq.  (2.4)  and  Eq.  (1.6)  into  Eq.  (2.5)  and  then  differentiate  these 
integrals  with  respect  to  time  to  obtain  the  following  first-order  differential  equations  for  complex  functions  Qj'(Ux) 
and  QpNL(t;x ): 


dQLp(t;x) 

dt 


+  YLpQLp(t;x)  =  aLpE(t;x), 


(3.1) 


*QNpL(t;x) 

a  t 


+  = af[E(t;x)]2 . 


(3-2) 


From  the  above  equations,  the  linear  and  nonlinear  polarization  vectors,  Pp(t;x)  and  [E(t;x)PpNL(t;x)],  can  be 
obtained  simply  by  taking  the  real  parts  of  QpL(t;x)  and  [E(t;x)QpNL(t;x)],  respectively.  Now,  solving  these  two 
equations  exactly  by  using  integrating  factors  exp( yp  t)  and  exp(ypL  t),  respectively,  and  then  integrating  between 
nAt  and  nAt+At,  we  have 


nit+ii 

Qp( n At  +  At;x ) = exp(  -  jj,  At  )Qp(  nAt;x ) + afc  exp(  -  y^At )  J E(  r;x )  exp[-  y^( nAt-r  )]dz , 

nit 


and 


(3.3) 


ni/+it 

Qf(nAt  +  At;x)=exp(-yfAt)Qf(nAt;x)+aNPLexp(-yNpLAt)  j[E(T;x)  f  exp[- yf(nAi-r )]dr .  (3.4) 

nil 

If  E(t;x)  is  assumed  to  vary  linearly  in  time  between  nAt  and  nAt+At,  as  in  the  piecewise  linear  approximation  of 
Eq.  (2.6),  we  can  substitute  Eq.  (2.6)  into  the  right  hand  side  (i.e.,  the  inhomogeneous  part)  of  Eqs.  (3.3)  and  (3.4)  so 
that  we  obtain  the  following  expressions  for  (Qpf+1  and  (QpNL)n*\  respectively,  at  discrete  time  step  ( nAt+At )  and 
discrete  spatial  location  (iAx,JAy,kAz): 


Qp(  nAt  +  At;iAx,jAy,kAz)=(  Qp$J 

=exp<- +sit<¥itP +($;,'  xKjP  ■  e-s 

and 

Qf(  nAt  +  AttiAx,  jAy,kAz)  =  (  QpL 

=exp(- yNpLAt)(  QNpL  fa  +(£”•* 

+  (3.6) 

In  the  auxiliary  differential  equation  approach,  we  first  use  Eq.  (2.28)  to  update  the  displacement  field  vector, 
Dij".  to  the  next  time  step,  (n+l)At.  Then  the  electric  field  vector  at  time  step  (n+l)At,  £y/+,>  is  updated  from 
Eq.  (2.3)  by  setting  t=(n+l)At : 


lRef<Q.Lp r  l+eo  h g?t£S  -  /K  • 


(3.7) 
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Substituting  Eqs.  (3.5)  and  (3.6)  into  the  above  equation  and  considering  the  same  one-dimensional  case  as  we  have 
treated  in  the  previous  section,  we  obtain  the  following  algebraic  cubic  equation  which  we  need  to  solve  tor  (&,  )y 
in  terms  of  known  quantities  bo,  bt ,  b2  and  b3  at  t=(n+I)At : 

bo+b^Er’  ),+b2  [(E?*1  )yl2+b3  [(£?+'  )yJ3  =  0,  (3-8) 

where  bo,  bh  b2  and  b3  are  given  by 

b0  =-(D?+'  )y  +e0  2  Re{exp(- rpAt)[ ( Qp)"  ]y  +( E?  )y  [(Wp.o  )W  ~(¥p.i )''°  l)  *  5  '  y 

p 

b,  +€„  ^  Rejtxpi -  YpLAi  H QNpL A?  +(Ypj  )l° 

+[( E"  )r  fKvfo  f‘  - 2<Vpj  >'"  *f  Ypj  <310) 

b2.2e,JlRe{(Enyl(Y^)'1’-  <¥%  ?"  J  } .  (3'n) 

<3-,2> 

P 

But  we  know  that  (D?*‘)y  is  obtained  from  Eq.  (2.28)  and  can  be  expressed  as 

(D^  )y  =  (D”  )y  (E*  )y  a  At  ,  (3-13) 

and  it  follows  from  Eq.  (2.3)  that  (D")y  has  the  form 

(£>;),  =€„€„(■  E?  )y+e0J,Re{  [(QLp)”  Jy  }+e0(  E?  %  }+e0  f(  E?  )y  f'LdoJ  ■  (3.14) 

Now,  substituting  Eqs.  (3.13)  and  (3.14)  in  Eq.  (3.9),  we  can  show  that  b0  [Eq.  (3.9)]  is  the  same  as  a0[Eq.  (2  32)]; 
likewise,  we  can  see  that  b,=a,  [Eqs.  (3.10)  and  (2.33)],  b2=a2  [Eqs.  (3.11)  and  (2.34)]  and \  b3=a3  p9S.  (3.12)  and 
(2  35)]  Hence  we  can  conclude  that  the  auxiliary  differential  equation  approach  is  equivalent  to  the  rec 
convolution  approach.  The  only  difference  is  that  the  auxiliary  differential  equation  approach  requires  addiUona^ 
memory  to  be  allocated  for  the  calculation  of  the  displacement  field  vector,  M  is  updated  via  Eq.  (2.28),  at  each 
time  step.  Thus,  in  the  auxiliary  differential  equation  approach,  we  use  Eqs.  (2.28),  (2.29),  (3.5),  (3.6)  and  (3.8)  to 
update,  respectively,  for  (DT‘ A- .  (H^\ .  HQpWh  .  (QpV  and  (E^h  at  each  ume  step  in  order  to 
perform  a  complete  simulation  of  the  electric  field  response  in  nonlinear  dispersive  materials. 

IV.  CONCLUSIONS 

Based  on  the  piecewise  continuous  recursive  convolution  (PCRC)  approach  presented  in  this  paper,  we  can 
predict  the  formation  of  nonlinear  solitary  waves  by  solving  Maxwell’s  equations  directly  for  the  propagation  of 
electromagnetic  waves  in  nonlinear  dispersive  media  that  exhibit  both  instantaneous  Kerr  and  Raman  scattering 
responses  The  PCRC  approach  has  been  shown  to  be  the  same  as  the  auxiliary  differential  equation  approach 
provided  that  the  auxiliary  differential  equations  are  solved  analytically  using  integrating  factors  and  die  same 
piecewise  linear  approximation  is  used  for  E(t;x)  to  integrate  the  inhomogeneous  part  of  auxiliary  differential 
equations.  Because  of  the  piecewise  linear  approximation  we  used  for  the  time  dependent  part  of  the  electric  fiel 
vector,  the  PCRC  results  in  second-order  accuracy  in  time.  As  a  whole,  the  PCRC  approach  retains  all  the 
advantages  of  the  usual  first-order  discrete  recursive  convolution  approach,  such  as  fast  computational  speed  and 
efficient  use  of  the  computer  memory;  however,  the  PCRC  approach  provides  second-order  accuracy  in  time. 

We  pointed  out  that  the  exponential  forms  of  the  linear  and  nonlinear  susceptibility  functions  were  crucial  in 
allowing  us  to  implement  the  recursive  feature  in  our  algorithm.  Also,  we  have  shown  that  the  FDTD  formulation 
for  nonlinear  dispersive  materials  results  in  having  to  solve  an  algebraic  cubic  equation  for  electric  field  values  at 
each  time  step  as  compared  to  just  solving  the  linear  equation  in  the  case  of  linear  dispersive  materials. 

We  believe  that  the  PCRC  approach  is  robust:  the  FDTD  algorithm  can  be  extend  to  two  and  three-dimensional 
cases  with  ease  by  simply  following  the  steps  we  have  presented  in  this  paper  for  the  one-dimensional  case. 
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E(t> 


Tine 

Fig.  1 .  Illustration  of  the  Piecewise  Linear 
Approximation  for  the  Electric  Field 
as  a  Function  of  Discrete  Time  Steps 
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Abstract 

As  digital  systems  reach  ever-increasing  speeds  and  microwave  circuits  become  more 
complex,  it  is  necessary  to  include  the  effect  of  lumped  circuit  elements  in  full-wave  electromagnetic 
simulation.  Applications  where  lumped  circuits  become  important  include  EMI/EMC  of  printed 
circuit  boards,  active  antenna  arrays,  and  integrated  circuit  packaging.  This  paper  provides  an 
overview  of  the  methods  available  to  include  both  passive  and  active  devices  in  FDTD  simulations. 


Introduction 

There  are  several  methods  for  including  lumped  elements  in  the  FDTD  method.  These 
methods  are  necessary  as  digital  and  microwave  circuits  increase  in  speed  and  complexity.  The 
electromagnetic  effects  in  these  systems  as  well  as  the  impact  of  passive  and  active  circuits  must  be 
taken  into  account.  The  methods  used  to  include  lumped  elements  in  FDTD  simulations  vary  in 
complexity  and  robustness,  and  sometimes  the  best  method  to  use  is  not  obvious.  This  paper  provides 
an  overview  of  the  methodology  involved  in  each  technique,  and  the  accompanying  presentation  will 
provide  examples  of  each  method  relevant  to  today’s  design  problems.  Recommendations  for  method 
choice  in  common  applications  will  also  be  presented. 


Extended  FDTD  Method 

One  of  the  basic  techniques  used  to  include  lumped  elements  in  the  FDTD  simulation  space  is 
the  extended  FDTD  method  [1].  This  method  utilizes  the  current  density  term  (Jc)  in  Ampere’s  law  to 
insert  the  current  through  a  lumped  device  (Jl)  into  the  full-wave  FDTD  solution.  Ampere  s  Law  with 
this  term  included  is  stated  as: 

V  x  H  =  £  -I- E+  J  c  +  fL  (!) 

dt 

Where  the  current  density  through  the  device  region  can  be  related  to  the  current  through  the  lumped 
element  by  simply  dividing  the  element  current  by  the  cross-sectional  area  of  the  FDTD  grid  cell.  The 
voltage  across  one  z-oriented  FDTD  cell  is  given  by  the  product  of  Ez  at  that  point  and  the  cell  height 
Az.  The  remaining  step  is  to  write  a  finite-difference  equation  relating  the  current  through  the  lumped 
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element  and  the  voltage  across  it.  Linear  relations  can  be  computed  using  a  simple  update  equation, 
but  nonlinear  devices  such  as  diodes  and  transistors  require  alternative  methods  to  be  used.  As  a 
simple  example,  consider  a  resistor.  The  current  through  this  device  is  related  to  the  voltage  by  V/R. 
Assuming  that  o=0  in  the  cell  where  the  device  is  located,  an  update  equation  for  a  resistor  could  be 
written  as: 


!••!*=  es  i",.*  tyVxu  i-;/42 


At  A zE  .  \-jk 
EAxAy  R 


(2) 


This  method  has  been  extended  into  3-D  [2]  and  to  allow  elements  to  occupy  multiple  FDTD  cells  [3]. 
However,  this  method  has  been  found  to  be  numerically  unstable  [4],  and  a  stable  algorithm  has  been 
proposed  which  uses  a  time-averaged  value  for  the  device  voltage. 


Equivalent  Source  Method 

The  equivalent  source  method  [5]  is  a  technique  which  is  concerned  only  with  the  relationship 
between  current  and  voltage  at  the  device  terminals.  Electric  and  magnetic  fields  in  the  device  region 
are  computed  using  a  separate  subroutine,  so  no  alterations  are  made  to  the  traditional  update 
equations.  In  this  method,  the  current  flowing  through  the  device  region  is  found  by  integrating  the 
magnetic  field  around  a  loop  surrounding  the  port.  This  current  is  then  substituted  into  an  analytic 
expression  for  the  device  voltage  as  a  function  of  the  current.  This  voltage  is  then  impressed  in  the 
device  region  by  setting  the  electric  field  quantities  along  the  port  such  that  the  device  voltage  is 
approximated  by  a  uniformly  distributed  electric  field  across  the  port.  The  magnetic  field  is  then 
updated  using  the  standard  update  equations,  and  the  process  is  repeated.  Alternatively,  the  device 
voltage  can  be  measured,  and  the  magnetic  fields  around  the  port  can  be  set  to  enforce  the  device 
current.  Because  this  method  does  not  involve  directly  manipulating  the  FDTD  update  equations,  it  is 
a  more  straight  forward  task  to  implement  nonlinear  and  multi-port  devices. 

A  derivative  of  the  equivalent  source  method  uses  an  external  circuit  solver  such  as  SPICE  to 
solve  the  I-V  relationships  of  the  lumped  circuit.  Because  this  method  requires  no  knowledge  of  the 
circuit  under  test,  including  linear  and  nonlinear  behavior,  it  is  well-suited  for  implementation  in  a 
generalized  software  package.  There  are  two  dual  techniques  for  interfacing  the  distributed  FDTD,  the 
current  source  [6]  and  voltage  source  [7]  methods.  These  methods  take  into  account  the  effects  of  the 
entire  FDTD  grid  on  the  region  where  the  lumped  element  resides  (port)  by  manipulating  Maxwell’s 
equations  in  the  region  around  the  port. 

The  current  source  method  is  based  on  Ampere’s  law,  which  can  be  stated  in  integral  form  as: 

•  dl  =  •  ds  +  \  3-e  E  •  ds  (3) 

which  can  be  written  in  terms  of  circuit  quantities. 

/  =  I(V  )  +  C  ±V  (4) 

I  is  the  total  current  flowing  through  the  port,  C  is  the  space  capacitance  of  the  FDTD  grid,  V  is  the 
voltage  across  the  device,  and  I(V)  is  the  device  current  as  a  function  of  device  voltage.  This  can  be 
seen  as  an  expression  of  Kirchoff  s  current  law  for  the  circuit  shown  in  figure  1. 


Lumped 

Device 


Figure  1.  Norton  Equivalent  Circuit  for  SPICE  Interface 


I(V) 


The  current  source  I  is  set  from  FDTD  by  performing  a  closed-loop  integral  of  the  magnetic  field 
around  the  port,  and  the  FDTD  space  capacitance  C  is  found  from  the  area  of  the  port,  the  permittivity 
of  the  material  in  the  port  region,  and  the  height  of  the  port  using  the  relation  for  a  parallel-plate 
capacitor.  The  voltage  across  the  terminals  of  the  lumped  device  is  then  found  from  SPICE,  and  the 
electric  field  values  in  the  port  region  are  set  such  that  the  integral  of  E-dl  across  the  port  is  equal  to 
this  voltage. 


The  voltage-source  method  is  based  on  Faraday’s  law,  which  can  be  stated  in  integral  form. 


As  was  done  for  the 
quantities. 


§>E  •  dl  = - J  fi  H  •  ds 

c  dt  s 

current-source  method,  this  equation  can 


(5) 

be  rewritten  in  terms  of  circuit 


v'« =  Ljr' +  v'"  (6> 

Where  V,ot  is  the  voltage  across  the  FDTD  port  region,  L  is  the  space-inductance  of  the  FDTD  grid,  I 
is  the  current  flowing  through  the  port,  and  Vdev  is  the  voltage  across  the  lumped  device.  Similarly  to 
the  current-source  method,  this  can  be  seen  as  an  expression  of  Kirchoff  s  voltage  law  for  the  series 


circuit  shown  in  figure  2. 


I 
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Figure  2.  Thevenin  Equivalent  Circuit  for  SPICE  Interface 


For  a  port  occupying  a  single  FDTD  electric  field  point,  the  relations  for  Vtot  and  L  are  given  by  [7,8]. 


S-Parameter  Method 

Another  approach  to  lumped  devices  is.  to  model  them  based  on  the  scattering  matrix 
representation  of  the  circuit.  This  method  would  allow  network-analyzer  measurements  of  fabricated 
structures  to  be  easily  included  in  FDTD  simulations,  and  will  be  very  useful  in  microwave  circuit 
design  where  device  behavior  is  often  given  term  of  s-parameters.  A  similar  technique  [9]  has  been 
used  to  represent  electromagnetic  simulation  results  in  lumped  circuit  simulators.  This  technique  has 
been  applied  to  transmission  line  termination  in  FDTD  by  specifying  the  reflection  coefficient  (Sn) 
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presented  by  the  termination.  This  method  relates  the  voltage  and  current  at  the  device  terminals 
through  the  reflection  coefficient  in  a  manner  similar  to  the  equivalent  source  method.  This  relation  is 
given  by 

v”"'>  =  suv'-'"  +  zref(su+i)r  (7) 

Where  is  the  characteristic  impedance  of  the  input  transmission  line.  This  method  is  currently 
limited  to  one-port  devices  with  real,  frequency-invariant  reflection  coefficients,  but  work  is  in 
progress  to  extend  this  method  to  include  multi-port  devices  with  complex  s-parameters  that  can  vary 
with  frequency. 


Conclusions 

This  paper  provides  an  overview  of  the  methodology  involved  in  each  technique  for  including 
lumped  elements  in  FDTD  simulations.  The  accompanying  conference  presentation  will  compare 
these  methods  with  examples  relevant  to  today’s  design  problems.  Common  applications  and 
recommendations  for  choosing  a  method  for  including  lumped  elements  will  also  be  discussed. 
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Abstract 

A  new  method  for  interfacing  standard  finite- 
difference  time-domain  grids  with  different 
node  spacings  at  material  boundaries  is  pre¬ 
sented.  This  method  greatly  enhances  the  com¬ 
putational  efficiency  when  modeling  structures 
that  contain  homogeneous  regions  with  large 
differences  in  refractive  index.  This  situation 
is  frequently  encountered  when  modeling  light 
sensitive  semiconductor  devices  on  which  light 
is  incident  from  vacuum.  In  addition  to  the  su¬ 
perior  computational  performance,  more  accu¬ 
rate  results  can  be  obtained  in  comparison  to 
standard  FDTD  due  to  decreased  numerical  dis¬ 
persion.  The  stability  of  the  method  is  dis¬ 
cussed  and  demonstrated  by  means  of  test  exam¬ 
ple  problems. 

1  Introduction 

FDTD  methods  [1]  play  an  ever  increasing  role  in 
modeling  all  kinds  of  optical  structures.  In  recent 
publications  simulation  results  for  light  propaga¬ 
tion  in  photonic  crystals  [2],  waveguides  [3],  (VC- 
SEL)  laser  cavities  [4],  integrated  optics  devices 
[5],  liquid  crystal  displays  [6],  photodetectors  [7], 
etc.  have  been  presented. 

One  major  drawback  of  standard  FDTD  is  that 
it  operates  on  a  so  called  rectilinear  tensor  prod¬ 
uct  grid  (RTPG).  By  this  we  mean  that  the  position 
of  all  the  nodes  is  determined  by  a  1  x  nj-vector 
containing  the  node  spacings  for  each  of  the  or¬ 
thogonal  coordinate  axes  i  (where  i  €  {1,2}  for 
2D  and  i  <=  {1,2,3}  for  3D  are  corresponding  to 
the  x-,  y-  and  z-directions  respectively).  In  the 
following,  we  refer  to  a  grid  as  being  uniform  if 
in  each  vector  all  the  elements  are  equal.  To  get 
sufficiently  accurate  results  from  an  FDTD  simu¬ 
lation,  the  node  spacing  has  to  be  chosen  suffi¬ 


ciently  small.  The  maximum  allowable  node  spac¬ 
ing  is  determined  by  a  couple  of  factors:  Firstly, 
the  grid  spacing  d  within  a  homogeneous  material 
region  needs  to  be  much  smaller  than  the  wave¬ 
length  A  in  this  material.  Generally,  d  <  A/10  is 
a  minimum  requirement  for  reliable  results.  For 
very  large  simulation  domains  (in  wavelengths), 
finer  grid  spacing  is  necessary  to  keep  the  er¬ 
rors  due  to  numerical  dispersion  below  a  tolera¬ 
ble  limit.  In  addition,  the  grid  spacing  needs  to 


Figure  1:  Nonuniform  rectilinear  tensor  product  grid 
for  a  region  of  high  refractive  index  ni  surrounded  by 
lower  refractive  index  no  <  nj. 

be  fine  enough  to  resolve  all  geometrical  features 
with  sufficient  accuracy.  If  a  part  of  the  simulation 
domain  bounded  by  [xmin,®max]/  [j/min,j/max]  (and 
[zmin,  zmax]  in  3D)  is  filled  with  a  material  of  high 
refractive  index  or  a  structure  containing  fine  geo¬ 
metrical  detail,  node  spacing  in  all  these  intervals 
has  to  be  set  accordingly.  In  general,  this  results 
in  fine  node  spacing  in  regions  of  the  simulation 
domain  where  it  would  not  be  required  according 
to  the  former  criteria.  This  can  lead  to  significant 
overheads  in  memory  requirements.  The  situation 
is  demonstrated  in  Fig.  1  for  a  nonuniform  grid. 

To  overcome  the  problems  arising  a  number 
of  local  subgridding  schemes  have  recently  been 
published  in  the  electromagnetics  literature  [8,  9, 
10,  11].  There,  the  need  to  use  subgridding  fre- 
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quently  stems  from  the  presence  of  fine  geometri¬ 
cal  detail.  The  algorithms  dealing  with  this  prob¬ 
lem  allow  to  mesh  a  subregion  of  the  simulation 
domain  with  a  local  grid  with  finer  node  spac¬ 
ing  than  in  the  surrounding  regions.  Updating  of 
this  local  grid  is  done  at  a  smaller  time  step  than 
the  one  used  for  the  coarse  grid.  Interpolation  be¬ 
tween  the  fine  and  coarse  grids  is  therefore  done 
in  both  time  and  space. 


When  simulating  light  sensitive  semiconduc¬ 
tor  devices,  the  standard  situation  is  somewhat 
different.  In  this  case,  the  required  node  spac¬ 
ing  is  often  determined  by  the  wavelength  and 
the  corresponding  accuracy  criterion  mentioned 
above.  Geometrical  features  are  in  many  cases 
large  enough  to  be  sufficiently  well  represented 
using  a  grid  spacing  of  d  <  A/10,  especially  with 
nonuniform  grids.  Therefore,  they  impose  no  ad¬ 
ditional  limits  on  the  node  spacing.  However, 
large  differences  in  refractive  index  normally  im¬ 
pose  strongly  varying  upper  limits  on  the  node 
spacing  in  the  various  regions.  Since,  on  the  other 
hand,  the  maximum  time  step  which  allows  stable 
operation  of  the  FDTD  algorithm  is  governed  by 
the  "optically"  smallest  cell  (where  "optical  size" 
is  given  by  tire  product  of  dimension  and  refrac¬ 
tive  index),  the  fine  resolution  in  the  low  refractive 
index  inherent  to  standard  FDTD  on  RTPGs  leads 
to  an  unnecessarily  small  overall  time  step.  These 
two  effects  considerably  limit  the  size  of  structures 
that  can  be  simulated  within  reasonable  computa¬ 
tion  time  for  given  memory  resources. 

In  this  article,  we  present  an  interpola¬ 
tion/integration  scheme  which  interfaces  FDTD 
grids  with  different  node  spacings  along  non- 
planar  material  interfaces,  thus  allowing  to  adapt 
the  node  spacing  in  homogeneous  domains  to  the 
corresponding  refractive  index.  This  way,  the 
"optical"  node  spacing  can  be  chosen  constant 
throughout  the  entire  simulation  domain,  lead¬ 
ing  to  a  common  upper  limit  for  the  stable  time 
step.  Therefore,  the  same  time  step  can  be  used  for 
both  grids,  i.e.  no  temporal  sub-sampling  has  to 
be  carried  out  in  the  high  resolution  domain.  The 
scheme  presented  is  an  extension  of  a  formulation 
for  grid  interfacing  at  planar  material  boundaries 
[12]. 


2  Derivation  and  Implementation 

In  this  section,  we  derive  an  algorithm  for  cou¬ 
pling  two  FDTD  grids  with  different  node  spacing 
along  a  material  interface  in  2D. 
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Figure  2:  Non-planar  FDTD  grid  interface. 
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Figure  3:  Interpolation  of  Ez  field  components 

Throughout  the  rest  of  this  article  we  assume 
that  we  have  two  homogeneous  regions  k  (k  = 
1, 2)  with  refractive  indices  n\  <  «2,  whose  com¬ 
mon  boundary  can  be  represented  by  a  staircase 
line.  Both  regions  are  supposed  to  have  the  same 
magnetic  permeability  /z.  It  is  assumed  that  Re¬ 
gion  k  is  meshed  by  a  rectilinear  FDTD  grid.  Grid 
k.  In  what  follows,  superscript  indices  will  be 
used  to  denote  the  grid  the  components  and  co¬ 
ordinates  belong  to.  The  grids  have  to  be  set  up 
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in  such  a  way  that  they  have  If -components  (for 
TM  polarization)  along  the  material  interface  to 
be  modeled.  Node  spacing  may  be  nonuniform 
in  both  grids,  so  the  latter  requirement  is  not  an 
overly  strict  limitation  as  long  as  minimum  fea¬ 
ture  sizes  are  of  the  same  order  of  magnitude  as 
the  wavelength  of  the  excitation  signal.  Fig.  2 
shows  a  detail  of  a  non-planar  interface  region  be¬ 
tween  two  grids  which  contains  all  the  possible 
interpolation  scenarios  that  may  appear  in  2D  and 
shows  all  the  components  that  will  be  required 
for  the  interfacing  process  are.  Note  that  the  ac¬ 
tual  location  of  the  vectors  in  the  x-y- plane  is  in 
their  centers,  not  at  their  feet  points,  for  reasons 
of  drawing.  As  indicated  in  the  Figure,  grids  have 
to  be  truncated  in  such  a  way  that  for  each  H 2- 
component  on  the  material  boundary  there  is  at 
least  one  adjacent  E2 -component  located  within 
Region  1.  All  Grid  2  field  components  within  Re¬ 
gion  2  and  all  i?2-field  components  on  the  region 
boundary  are  updated  by  the  standard  Yee  [13] 
algorithm,  also  all  Grid  1  components  within  Re¬ 
gion  1.  We  will  refer  to  these  components  as  reg¬ 
ular  grid  components,  they  are  drawn  in  black  in 
Fig.  2.  All  the  remaining  field  components  will  be 
referred  to  as  irregular;  the  components  depicted 
in  grey  in  Fig.  2  are  all  irregular. 

To  explain  our  interpolation  scheme,  we  shortly 
review  the  Yee  scheme  for  time  stepping  in  stan¬ 
dard  FDTD.  Starting  from  Maxwell's  equations  in 
integral  form 

[  EdS  =  -  Hds  --  l  JdS  (1) 
dt  Js  e  Jos  e  Js 

£  f  HdS  =  --  Eds  (2) 

at  Js  V  Jds 

one  can  derive  the  standard  FDTD  update  equa¬ 
tions  for  the  electric  field 


+Cfc(i?ylij  H'j I i  —  2 . , 


(3) 


,«+l/2 


with  Ca  =  [1  -  a*Af/(2e*)][l  +  a*Af/(2e*)]_1, 
cb  =  At/{ekdk)[l  +  (l  +  akAt)/(2ek)]~1  and  for  the 
magnetic  field 

**  \ijl/2  =  H*\ iJ1/2  +  A  fck* "  E>ki+ 0 


Hy^2  =  *v\ul/2  +  ~  E*kj)\ 

(4) 

where  we  assumed  for  simplicity  of  notation  that 
we  have  uniform  quadratic  grids,  i.e.  equal  node 
spacings  in  x-  and  y-directions.  In  addition,  we 
restrict  our  attention  to  TM  polarization,  i.e.  the 
electric  field  in  ^-direction  and  the  magnetic  field 
components  in  the  x-  and  y-directions.  General¬ 
ization  to  both  TE  polarization  and  nonuniform 
grids  is  straightforward;  in  fact,  what  follows  is 
not  influenced  by  whether  the  involved  grids  are 
uniform  or  not. 

When  updating  the  H2 -field  components  on  the 
material  boundary  according  to  (4),  appropriate 
values  have  to  be  found  for  the  E2 -nodes  in  Re¬ 
gion  1  (i.e  the  grayed  ones  in  Fig.  3).  To  com¬ 
pute  these  values,  three  surrounding  regular  E* - 
components  from  both  grids  are  chosen  for  each 
node.  Let  A(k,i,j)  denote  the  rectangle  contain¬ 
ing  all  points  (x,y)  with  x^  <  x  <  x*+1^  and 
Vi,j  <  y  <  Vij+v  where  (xf^,y^)  is  the  coordinate 
of  the  Grid  k  node  The  value  for  E2\{  j  is 

then  determined  as  follows: 

1.  Find  m,n  so  that  (x?j,y?j)  €  A(l,m,n).  De¬ 
termine  the  regular  components  in  {E\  |  , 

Ez\m+l,n'  Ez  lm,n-t-l'  Ez  lm+l,n+l}‘  ^  three 
such  components  are  found,  select  the  two  di¬ 
agonally  opposing  one  another. 

2.  Find  the  regular  components  in  {E2 |t-_j  -, 
Ez\ij-i'  Ez\i+ij'  Ez\iJ+ib  i-e-  the  direct 
neighbors  of  E2  |2  J-. 

3.  If  only  one  regular  component  in  each  grid 
was  found  in  the  above  steps,  proceed  as 
follows:  Find  the  regular  direct  members  of 
the  regular  Grid  2  component.  For  each  of 
these,  form  a  triangle  with  the  two  compo¬ 
nents  found  in  steps  1.  and  2.  Select  the  com¬ 
ponent  which  forms  the  triangle  that  contains 

The  #2 -value  at  (x2j,yfj)  can  now  be  computed 
by  fitting  an  interpolation  plane  through  the  val¬ 
ues  at  the  three  surrounding  points  and  determin¬ 
ing  the  value  at  the  above  position.  For  the  scheme 
to  work  properly,  one  has  to  ensure  that  the  E- 
field  components  in  both  grids  have  been  updated 
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before  the  interpolation  is  carried  out.  Fig.  3  il¬ 
lustrates  the  situations  that  may  appear.  The  val¬ 
ues  for  the  grayed  E* -components  are  computed 
by  interpolating  between  the  values  of  the  com¬ 
ponents  connected  to  them  by  the  dashed  lines 
(for  reasons  of  clarity,  the  lines  are  only  drawn  for 
some  representative  fine  grid  components).  Note 
that  with  the  above  algorithm  problems  in  find¬ 
ing  appropriate  interpolation  triangles  may  oc¬ 
cur  if  cells  with  large  aspect  ratios  are  present  at 
the  boundary,  especially  if  dl/d2  »  3.  How¬ 
ever,  it  shall  be  pointed  out  that  the  idea  of  the 
index  adaptive  gridding  scheme  presented  is  to 
have  as  uniform  and  quadratic  grids  as  possible 
within  homogeneous  regions  with  appropriate  in¬ 
terfaces  between.  This  way,  optimum  efficiency 
is  achieved  (see  below).  If  large  aspect  ratios  can 
nevertheless  not  be  avoided,  the  algorithm  can 
easily  be  extended  to  cope  with  the  problems  aris¬ 
ing. 

To  update  the  I?1 -field  nodes  adjacent  to  H1- 
field  components  located  on  the  material  bound¬ 
ary,  the  values  for  these  latter  components  have  to 
be  determined  by  averaging  the  values  of  the  ap¬ 
propriate  H 2  components  on  the  boundary.  These 
components  are  found  as  follows:  Starting  from 
an  Enfield  node  which  was  required  for  the  in¬ 
terpolation  of  some  irregular  E2 -field  values,  any 
#2-component  on  the  boundary  adjacent  to  one 
of  these  E2 -field  values  will  be  taken  into  account 
in  the  averaging  process  for  any  #3 -component 
on  the  boundary  adjacent  to  the  E1 -field  node 
started  with,  where  either  v  =  x  or  v  =  x.  In 
the  averaging  process,  every  E^-component  will 
be  weighted  with  exactly  the  weight  the  adjacent 
E1 -field  node  was  weighted  with  when  interpo¬ 
lating  the  values  for  the  irregular  E2 -field  value. 
In  numerical  experiments  carried  out,  it  was  no¬ 
ticed  that  this  averaging  procedure  is  crucial  for 
guaranteeing  the  late-time  stability  of  the  interfac¬ 
ing  scheme.  If  the  averaging  is  carried  out  in  such 
a  way  that  each  H2 -component  on  the  bound¬ 
ary  contribute  only  to  the  value  of  its  closest  Hi- 
component,  instabilities  are  observed  after  a  few 
10000  time  steps  for  standard  examples. 

For  quadratic  uniform  grids,  optimum  perfor¬ 
mance  is  achieved  when  d\  =  d2«2/^i  is  chosen, 
resulting  in  a  common  Courant  limit  for  the  stable 
time  step  At  for  both  grids,  since  stability  consid¬ 


erations  require  that 


As  was  pointed  out  in  [14],  this  time  step  results 
in  the  smallest  numerical  dispersion  error  possi¬ 
ble  within  a  homogeneous  region  with  refractive 
index  n*  and  a  resulting  phase  speed  of  c  =  co/n*. 
As  a  result,  the  subgridding  scheme  presented 
above  not  only  allows  more  efficient  computation, 
but  also  higher  modeling  accuracy  in  the  presence 
of  large  refractive  index  variations. 

3  Numerical  Results 

3.1  Quasi-ID  Test  Example 

Various  tests  were  carried  out  to  evaluate  the  ac¬ 
curacy  of  the  formulation  presented.  First,  we 
considered  a  quasi  ID  test  example  of  a  Gaus¬ 
sian  pulse  plane  wave  incident  onto  a  planar  ma¬ 
terial  interfaces.  For  this  purpose,  we  used  uni¬ 
form  quadratic  Ik  x  grids  with  l\  =  2,  mi  = 
100,  h  =  6,  m2  =  130  and  d2  =  di/3.  Peri¬ 
odic  boundary  conditions  were  applied  at  i  =  0 
and  i  =  1*.  A  Gaussian  pulse  with  a  maximum 
frequency  of  about  fmax  —  co/(10di)  was  used 
as  excitation,  giving  a  minimum  wavelength  of 
about  10di  within  Region  1.  m  =  1  was  as¬ 
sumed  for  Region  1,  several  calculations  were 
done  for  Region  2  filled  with  materials  having 
different  refractive  indices  n2.  Time  steps  were 
At  =  min  ^d2«2/(\/5co),di/(\/2co)j.  There- 
fleeted  fields  were  computed  by  carrying  out  a  ref¬ 
erence  simulation  with  no  material  and  grid  inter¬ 
face  present.  Frequency  dependent  reflection  co¬ 
efficients  were  then  calculated  by  Fourier  trans¬ 
form  and  compared  to  analytical  results.  Fig.  4 
(left)  shows  the  errors  in  numerically  computed 
reflection  coefficients  tfdtd  -  ’’analytical  for  vari¬ 
ous  refractive  indices  n2.  The  results  show  ex¬ 
cellent  agreement  between  analytical  and  numer¬ 
ical  results  when  n2  =  3  =  d\jd2  *  ni.  For  re¬ 
fractive  indices  n2  ^  3,  the  errors  increased  with 
rc2  —  3.  Computations  were  also  carried  out  us¬ 
ing  a  number  of  smaller  time  steps  without  any 
significant  change  in  results.  For  comparison,  we 
also  examined  reflection  at  a  material  interface  us¬ 
ing  a  uniform  fine  grid  with  node  spacing  d  =  d2 
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Figure  4:  Errors  in  numerically  computed 

ri2  -  1.5/ - 7*2  =  2, - 712  =3/  —  712  =  4. 

grid  with  fine  mesh.  Note  that  d  =  d\  /3. 


reflection  coefficients  tfdtd  —  ^analytical:  —  no  —  1,  •  •  •  «2  —  1-0201, 
Left:  Material  boundary  at  grid  resolution  interface.  Right:  Uniform 


in  both  regions.  Here,  the  time  step  used  was 
At  =  d2/(%/2c0).  The  results  are  also  shown  in 
Fig.  4.  In  this  case,  there  is  obviously  no  conceiv¬ 
able  error  for  n2  =  1.  With  increasing  difference 
in  refractive  index  between  the  two  regions,  the 
error  in  the  reflection  coefficient  increases.  For 
the  comparatively  high  maximum  error  of  0.03 
for  n2  =  4,  one  has  to  be  aware  that  at  the  cor¬ 
responding  wavelength  A2  in  Region  2,  one  has 
\2/d  =  7.5  <  10,  i.e.  a  value  too  small  for  ob¬ 
taining  high  accuracy  results. 

Fig.  4  demonstrates  that  the  most  accurate  re¬ 
sults  are  found  when  the  "optical"  node  spacing 
is  equal  in  regions  with  different  refractive  index. 
The  more  unbalanced  the  "optical"  node  spacing, 
the  bigger  the  errors  that  are  obtained  in  the  cal¬ 
culation.  This  can  be  thought  of  as  an  extension 
of  results  for  simulation  with  nonuniform  grids, 
where  it  was  found  that  the  spacings  d±  between 
adjacent  nodes  need,  as  a  rule  of  thumb,  to  obey 
0.5 dk  <  di+ 1  <  2 di  {i  e  {1,1}  and  equivalent  in 
the  other  coordinate  directions)  to  get  sufficiently 
accurate  results  [1]. 

To  assess  the  usefulness  of  the  presented  formu¬ 
lation  for  simulation  of  actual  2D  problems,  we 
first  modeled  the  light  propagation  within  a  test 
structure  on  both  a  uniform  grid  and  two  adjacent 
grids  with  planar  interfacing  and  compared  the 
results.  This  time,  we  had  d2  =  di/4,  l\  =  m i  = 
100,  h  =  400,  m2  =  200.  Region  2  was  assumed  to 
be  filled  with  a  material  with  n2  =  4,  e.g.  silicon. 
Part  of  Region  1  is  filled  with  a  structure  consist¬ 
ing  of  a  material  with  n  =  1.5,  e.g.  silica,  which 
extends  30  nodes  in  x-direction  and  60  nodes  in  in¬ 


direction,  throughout  the  rest  of  Region  1  we  have 
m  —  1.  Similar  situations  are  often  encountered 
in  photodetector  or  sensor  modelling,  where  some 
structure  (e.g.  a  microlens  or  grating)  is  supposed 
to  focus/ guide  light  into  a  semiconductor  sub¬ 
strate,  which  is  responsible  for  the  electronic  part 
of  light  detection.  Periodic  boundary  conditions 
were  applied  at  j  =  0  and  j  =  m^-Fig.  5  shows  the 
intensity  distribution  for  reflection /transmission 
of  a  Gaussian  pulse  plane  wave  with  a  maximum 
frequency  of  about  /max  =  c0/(25di)  incident  on 
the  structure /interface.  For  the  graph  on  the  left 
side,  computation  was  carried  out  on  a  uniform 
mesh  with  d  =  d2,  for  the  graph  on  the  right  side, 
the  algorithm  presented  was  used.  The  results  are 
in  very  good  agreement. 

To  compare  the  computational  effort  required 
for  the  two  different  models,  we  assume  that  for 
the  case  with  mesh  interfacing,  about  three  quar¬ 
ters  of  the  computational  domain  are  filled  with 
the  coarse  grid.  The  total  number  of  cells  is  there¬ 
fore  7/16  of  the  total  number  of  cells  in  the  uni¬ 
form  fine  grid  case,  resulting  in  a  likewise  de¬ 
crease  in  memory  requirements.  The  number  of 
update  steps  necessary  to  reach  a  given  state  re¬ 
duces  to  1  /4  of  that  for  the  uniform  grid.  Together, 
this  results  in  a  decrease  in  computation  time  by  a 
factor  of  about  8. 

3.2  2D  Non-planar  Interface 

To  assess  the  usefulness  of  the  presented  formu¬ 
lation  for  simulation  of  actual  2D  problems,  we 
modeled  the  light  propagation  within  a  test  struc- 
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Figure  5:  Planar  grid  interface:  Intensity  distribution  for  a  Gaussian  pulse  plane  incident  on  a  structure  on  top  of 
a  planar  material  interface.  The  high  refractive  index  region  is  to  the  right  of  the  thick  white  line,  the  plane  wave 
is  incident  from  the  left  (i.e.  traveling  in  +£-direction.  Results  for  a  uniform  fine  grid  (left)  are  compared  to  results 
obtained  by  using  a  coarser  grid  in  Region  1  (right).  Contour  lines  are  at  the  same  intensity  levels  in  both  plots. 


ture  on  both  a  uniform  grid  and  two  adjacent 
grids  with  interfacing  and  compared  the  results. 
For  the  refractive  indices,  it  was  assumed  that 
7*2  =  3ni.  Uniform  node  spacing  was  chosen 
in  both  grids  with  d2  —  di/Z.  The  time  depen¬ 
dence  of  the  incoming  plane  wave  was  of  the  form 
sin(wt)  exp(-f2/a2),  with  u  and  a  chosen  in  such  a 
way  that  the  maximum  of  the  wavelength  spec¬ 
trum  is  at  A  =  20di  and  the  minimum  wave¬ 
length  at  A  ~  10di  (both  in  Region  1).  Periodic 
boundary  conditions  were  applied  at  j  =  0  and 
j  =  m*.  The  structure  to  be  considered  consists  of 
a  wedge  shaped  non-planar  interface.  The  wedge 
is  105c?2  in  z -direction  and  59^2  in  y-direction. 
Fig.  6  shows  the  intensity  distribution  for  reflec¬ 
tion/transmission  of  the  incident  wave  at  the  in¬ 
terface.  Fig.  7  shows  a  cut  through  the  plots  from 
Fig.  6  along  the  ^-direction  at  y-nodes  j  =  20  and 
j  =  90.  All  the  results  are  in  good  agreement.  As 
for  the  deviations  conceivable,  one  has  to  bear  in 
mind  that  in  the  quasi-ID  example  more  accurate 
results  were  obtained  when  simulating  with  the 
interfaced  grids. 

3.3  Stability 

As  was  mentioned  above,  an  appropriate  combi¬ 
nation  of  interpolation  and  averaging  is  necessary 
to  ensure  the  late-time  stability  of  the  method  pre- 


Figure  8:  Demonstration  of  numerical  stability:  The 
field  energy  in  a  closed  resonator  with  a  subgrid  region 
is  shown  for  100000  time  steps.  No  instabilities  can  be 
observed.  If  instabilities  were  present,  an  increase  of 
field  energy  with  time  would  result. 


sented.  In  [11],  conditions  for  the  stability  of  sub- 
gridding  schemes  are  given,  which,  however,  are 
only  sufficient  but  not  necessary.  A  detailed  eigen¬ 
value  analysis  of  the  updating  scheme  presented 
would  go  beyond  the  scope  of  this  article.  We 
therefore  restrict  to  demonstrating  the  numerical 
stability  of  the  method  by  presenting  simulation 
results  for  a  suitable  test  structure.  We  used  a  res¬ 
onator  that  is  terminated  by  perfectly  conducting 
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Figure  6:  Non-planar  grid  interface:  Electrical  field  for  reflection/transmission  of  a  pbne 

region  interface^ (white  line),  computed  on  a  uniform  fine  mesh  (top)  and  on  two  grids  with  different  node  spacing 

interfaced  at  the  boundary.  Contour  lines  are  at  equal  levels  in  both  plots. 


walls  on  all  four  sides.  The  resonator  is  assumed 
to  be  filled  with  vacuum  (ni  =  1)  and  meshed  by 
Grid  1  with  dimensions  h  =  mi  =  60.  An  8  x  8- 
node  region  located  at  i,j  =  (10,10)  contains  a 
material  with  refractive  index  n2  and  is  meshed 
by  Grid  2  with  d2  =  d\jn2.  Simulations  were  car¬ 
ried  out  for  n2  =  3  and  n2  =  5.  In  both  cases,  the 
common  Courant  limit  given  by  Eq.  (5)  was  used 
to  time  advance  both  grids.  The  electromagnetic 
field  was  introduced  by  a  dipole  excitation,  with 
the  same  time  dependence  as  in  the  previous  sec¬ 
tion,  i.e.  with  a  minimum  wavelength  of  A  ~  10di 
in  Region  1.  Fig.  8  shows  the  electromagnetic  field 
energy  in  the  resonator,  obtained  by  numerically 


integrating  the  energy  density  (eE2+fiH2)/2  over 
both  regions,  for  the  n2  =  5  case.  No  increase 
or  decrease  in  the  field  energy  can  be  observed. 
If  stability  were  violated,  this  would  result  in  an 
increase  of  field  energy  with  time.  The  small  os¬ 
cillations  are  of  numerical  nature,  they  have  also 
been  observed  for  standard  FDTD  on  a  single  grid. 
Equivalent  results  were  found  for  the  n2  =  3  case. 


4  Conclusions 

A  new  algorithm  for  refractive  index  adaptive 
gridding  in  standard  FDTD  has  been  presented. 
This  algorithm  can  greatly  reduce  the  computa- 
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Figure  7:  Cut  through  the  plots  from  Fig.  6  along  the  x-direction  at  y-nodes  j  =  20  (left)  and  j  —  90  (right).  The 
solid  line  corresponds  to  the  solution  obtained  using  two  interfaced  grids  with  different  node  spacing.  The  dotted 
line  was  found  by  using  a  uniform  grid  with  fine  resolution. 


tional  effort  when  modelling  structures  with  ho¬ 
mogeneous  regions  with  strongly  differing  refrac¬ 
tive  indices,  which  are  frequently  encountered 
when  simulating  light  sensitive  semiconductor 
devices.  In  addition,  when  choosing  appropriate 
node  spacing,  the  algorithm  gives  more  accurate 
results  for  reflection  at  planar  material  interfaces 
than  standard  FDTD.  This  was  found  by  compar¬ 
ison  to  analytical  results.  The  numerical  stability 
of  the  scheme  presented  has  been  demonstrated 
by  means  of  test  examples.  The  method  has  re¬ 
cently  been  extended  to  3D,  however,  optimum 
implementation  and  stability  issues  are  still  under 
research. 
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Abstract 

Printed  small  antennas  are  becoming  the  main  subject  of  investigation  for  improve 
performance  of  personal  wireless  communication  systems.  In  recent  presentations  a  new  type  ot 
printed  meander  line  antenna  is  introduced.  In  this  paper,  the  analysis  of  this  type  of  antenna  is 
performed  using  the  finite  difference  time  domain  technique.  Empirical  design  equations  are 
obtained  for  the  resonant  (operating)  frequency  and  the  corresponding  input  impedance  Sample 
of  the  new  designs  provide  antennas  with  good  matching  impedance  to  a  50  fi  system  and  a  dual 
band  operation. 


Introduction 


With  the  advancements  of  modem  integrated  circuit  technologies,  personal 
communication  systems  (PCS)  are  the  fast  growing  sections  of  the  present  telecommunication 
industry.  Modem  designs  of  wireless  personal  communication  systems  are  featured  in  light 
weigh,  small  size,  high  frequency  operation,  and  high  transmission  efficiency.  Mobile  antenna 
design  is  one  of  the  major  tasks  in  PCS  designs.  In  addition  to  the  general  features  needed  for 
PCS  antennas  for  mobile  personal  communications,  there  is  also  a  requirement  that  the  antenna 
be  easily  integrated  with  the  interior  circuitry  of  the  system.  One  of  the  most  widely  used 
wireless  communications  systems  is  the  global  system  for  mobile  (GSM)  communications, 
which  operates  at  890-915  MHz  for  uplink  and  935-960  MHz  for  downlink.  The  new  genemtion 
of  personal  communication  systems,  such  as  digital  communication  systems  (DCS)  1800, 
operates  at  1.710-1.785  GHz  for  uplink  and  1.805-1.880  GHz  for  downlink.  Therefore,  antennas 
for  current  personal  communication  systems  are  required  to  operate  at  the  frequency  range  form 


0.9  GHz  to  2.0  GHz.  If  the  future  use 
of  higher  frequency  PCS  and  the 
possibility  of  applying  PCS  for 
military  purposes  are  considered,  then 
antenna  designs  for  wireless  personal 
communication  systems  should  be 
expanded  in  scope  to  coyer  the 
frequency  range  from  the  current  0.9- 
2.0  GHz  to  0.9-2.5  GHz  range.  In  this 
paper,  studies  for  the  characteristics  of 
a  printed  meander  line  antenna  [1-5], 
using  the  finite  difference  time 
domain  (FDTD)  [6]  techniques  with 


Figure  1  A  meander  line  antenna  on  a  small  ground  plane. 
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Berenger’s  perfectly  matched  layers  (PML)  [7]  absorbing  boundaries  are  presented.  The 
presented  designs  of  the  antenna  shown  in  Fig  1,  feature  small  dimensions  (77  x  1 1  x  3.17  mm3), 
approximately  50  D  input  impedance,  dual  frequency  band,  and  operate  within  the  0.9-2.5  GHz 
range  on  a  comparably  small  ground  plane  (60  x  27  mm2).  One  of  the  validations  of  the 
numerical  code  used  in  this  investigation  is  made  by  computing  the  return  loss  of  a  rectangular 
printed  patch  antenna  and  comparing  the  numerical  results  with  the  computed  and  measured  data 
reported  in  [8,  9].  Empirical  design  equations  and  the  analysis  for  the  input  impedance  and 
resonance  frequencies  of  meander  line  antennas  are  also  provided. 

Analysis 

The  antenna  considered  in  this  investigation,  as  shown  in  Figure  1,  is  a  meander  line  trace 
printed  on  a  dielectric  slab  sitting  on  a  perfectly  conducting  ground  plane.  The  parameters  el  and 
e2  represent  the  lengths  of  the  vertical  and  horizontal  printed  traces,  respectively.  In  this  study, 
el  and  e2  are  chosen  to  be  3  mm,  and  the  width  of  the  printed  trace  is  1mm.  The  parameter  is 
the  vertical  length  of  the  meander  line.  The  distance  between  the  edge  of  the  dielectric  slab  and 
the  edge  of  the  ending  segment  of  the  meander  line  is  set  equal  to  el.  The  reason  for  modeling 
the  meander  line  antenna  on  a  small  ground  plane  (60  x  27  mm2)  is  to  simulate  its  performance 
when  this  antenna  is  placed  on  top  of  a  PCS  handset.  The  main  objective  of  this  research  is  to 
design  a  meander  line  antenna  with  50  Q  input  impedance  at  dual  operating  frequencies  within 
0.9  to  2.5  GHz.  Other  important  parameters  are  to  minimize  the  size  of  the  dielectric  slab  and  the 
ground  plane.  The  frequency  range  under  study  is  extended  form  0.9-2.5  GHz  to  0.5  -3.0  GHz 
for  clearer  observations  of  the  resonance  behavior  of  the  antenna. 

The  finite  difference  time  domain  (FDTD)  technique  is  applied  to  model  the  antenna 
inside  a  3  dimensional  air  chamber  terminated  with  the  artificial  boundaries  of  Berenger’s 
perfectly  matched  layers  (PML)  [7].  The  PML  is  adopted  to  reduce  the  numerical  reflection  from 
the  truncated  boundaries  of  the  finite  problem  space,  which  resembles  the  anechoic  chamber  for 
antenna  measurements.  An  array  of  voltage  sources  with  a  Gaussian  waveform  are  placed 
between  the  ground  plane  and  the  edge  of  the  first  vertical  segment  of  the  trace  line  for 
excitations. 

To  begin  the  meander  line  antenna  design,  the  width  (Ws)  and  thickness  (ts)  of  the 
dielectric  slab  are  set  to  3.17  mm  and  11  mm,  respectively.  The  changes  in  impedance  and 
resonant  frequency  are  observed  by  increasing  the  number  of  segments  of  the  meaner  line  trace 
gradually  until  the  design  goal  is  achieved.  The  effects  of  dielectric  constant,  the  width  of  the 
slab,  and  thickness  of  the  slab  are  also  studied.  In  this  simulation,  the  values  of  the  dielectric 
constant  are  2.2,  3.1,  4.1,  7.1,  and  10.2,  while  the  thicknesses  of  dielectric  slab  are  0.794,1.588, 
3.175, 4.763,  and  6.35  mm.  These  values  represent  commonly  used  materials  for  radio  frequency 
(RF)  and  microwave  circuit  boards.  The  width  of  the  dielectric  slab  is  varied  from  1 1  to  41  mm 
which  is  an  appropriate  width  to  be  incorporated  with  a  PCS  handset.  From  these  numerical 
experiments,  empirical  equations  are  derived  to  facilitate  the  design  procedure. 

Numerical  Results 

The  computation  of  the  return  loss  of  a  rectangular  patch  antenna  presented  in  [8]  as 
shown  in  Figure  2  is  reported  here  as  a  validation  for  the  developed  code.  The  FDTD  cell 
discretization  and  the  dimensions  of  this  patch  antenna  can  be  found  in  [8].  In  Figure  2,  good 
agreement  between  the  computed  numerical  results  and  the  published  computed  and  measured 
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Figure  2  The  Return  loss  of  a  rectangular  patch  antenna. 
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Figure  3  The  input  impedance  of  a  meander  line  antenna  versus 
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data  are  observed.  The  necessary  number  of  layers  of  air  buffer  between  the  patch  antenna  and 
the  PML  is  also  illustrated  in  this  example.  Reference  [9]  presents  results  obtained  by  placing  10 
cells  of  PML  on  top  of  5  cells  of  air  above  the  patch  and  3  cells  of  air  away  from  the  edge  of  the 
patch.  In  this  paper,  the  number  of  PML  cells  used  in  all  numerical  cases  is  8.  In  Figure  2, 
negligible  difference  is  observed  due  to  the  placement  of  the  PML  cells  2  and  5  cells  away  from 
the  top  surface  of  the  patch.  Therefore,  the  PML  can  be  placed  very  close  to  the  antenna  under 
study  as  concluded  in  [7  and  9],  which  allows  more  computational  resources  for  the  problem 


space. 

The  input  impedance  of  the  initial  design  (ts  =  3.17  mm  and  Ws  =  1 1  mm)  of  the  meander 
line  antenna  for  various  numbers  of  segments  are  plotted  in  Figure  3.  As  shown  in  the  Figure,  the 
increase  of  Lax  decreases  all  the  resonant  frequencies  but  increases  the  resonance  impedance. 
Additionally,  an  optimal  case  was  found  when  using  a  length  of  48  segments  (73  mm  for  Lax\ 
where  the  input  impedance  at  the  first  and  third  resonance  frequency  is  48.5  Q  and  44  fi, 
respectively.  This  design  shows  that  the  antenna  can  operate  at  both  0.95  and  2.39  GHz  and  can 
adequately  be  matched  to  the  traditional  50  Q  impedance  of  the  front-end  circuitry.  This  50  Q 
impedance  characteristic  should  simplify  the  system  components  by  removing  the  impedance 
matching  network  between  the  feed  and  the  antenna.  The  return  loss  using  different  lengths  of 
Lax  are  plotted  in  Figure  4,  which  shows  that  the  bandwidths  of  this  case  are  8  %  and  4  %, 
respectively.  The  empirical  equations  for  the  first  resonance  frequency  and  impedance  for 
various  values  of  Lax  are  derived  from  regression  curve-fit  of  the  data  plotted  in  Figure  5  and  6, 
respectively.  In  Figure  5,  the  regression  curve  is  also  compared  to  the  first  resonant  frequency 
computed  by  the  empirical  equation  presented  in  [1,  3,  5],  which  define  the  length  shortening 
ratio  as 


SR  = 


X/l-lLn  ’ 
A/2 


(1) 


Reference  [5]  suggests  using  a  shortening  ratio  between  33%  and  35%  for  computing  the 
first  resonant  frequency  of  meander  line  antennas.  From  Figure  5,  the  resonant  frequency 
computed  by  equation  (1)  departs  from  the  currently  presented  regression  curve  as  Lax  increases. 
The  empirical  equations  for  the  first  resonant  frequency  of  meander  line  antennas  using  1  mm 
trace,  el  =  e3  =  3mm,  ts  =  3.17  mm  and  Ws  =  1 1  mm,  on  a  60  x  27  mm2  ground  plane  is 

fo  ~  22.69Z^'7392  (2) 


and  whose  first  resonant  impedance  as  shown  in  Figure  6,  can  be  modeled  by 

Z„=  22.94 le°'0103i“  (3) 

These  new  equations  (1)  and  (2)  can  not  only  be  used  to  solve  for  possible  L^  when  a  desired 
impedance  is  specified,  but  also  to  predict  the  first  resonant  frequency  and  impedance  for  a 
chosen  Lax- 

In  addition  to  changing  Lax,  different  values  of  the  substrate  dielectric  constant  are 
adopted  to  examine  their  influence  on  both  resonant  frequency  and  impedance  in  the  optimal 
case.  From  Figures  7  and  8,  both  resonant  frequency  and  impedance  decrease  with  the  increase 
of  the  dielectric  constant.  The  effects  on  resonant  frequenies  are  not  as  strong  as  that  on  the 
impedance. 
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Figure  4  The  return  loss  of  a  meander  line 
antenna  versus  L„. 


Figure  5  The  first  resonant  frequency  of  a 
meander  line  antenna  versus  Lu. 


Figure  6  The  first  resonant  impedance  of  a 
meander  line  antenna  versus  L„. 


23456789  10  11 


E  r 

Figure  7  The  first  and  third  resonant 
frequencies  of  a  meander  line  antenna 
versus  the  substrate  dielectric  constant 
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Figure  8  The  first  and  third  resonant 
impedance  of  a  meander  line  antenna 
versus  the  substrate  dielectric  constant. 
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Figure  9  The  first  and  third  resonant 
frequencies  of  a  meander  line  antenna 
versus  the  substrate  width. 
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Figure  10  The  first  and  third  resonant 
impedance  of  a  meander  line  antenna 
versus  the  substrate  width. 


Figure  11  Dual  band  50  £2  meander  line  antennas. 


The  effects  of  width  of  the  dielectric  slab  are  also  investigated.  From  Figure  9  the 
variations  of  the  first  and  third  resonant  frequencies  due  to  the  change  of  slab  width  are 
insignificant.  However,  the  effects  on  the  resonant  impedance  are  clearly  noticable.  From  Figure 
10  the  increase  of  the  slab  width  casued  the  reduction  of  the  resonant  impedance  Furthermore 
reducing  the  slab  width  from  1 1  mm  to  7  mm  results  in  another  useful  case  with  48.53  and  46.66 
Q  for  the  first  and  third  resonances  as  shown  in  Figure  10.  Transfenng  this  impedance 
information  into  return  loss,  results  in  a  return  loss  of  35.2  dB  with  8%  bandwith  and  28  dB  with 
3%  bandwidth  for  the  first  and  third  resonances,  respectively  as  shown  in  Figure  11.  In i  this 
design  the  slab  width  is  only  4  mm  more  than  the  width  of  the  meander  line  trace.  From  these 
results,  an  appropriate  method  for  fine-tuning  the  resonant  impedance  can  be  made  by  adjusting 
the  slab  width,  which  is  also  feasible  in  antenna  realizations.  . 

Finally,  the  effect  of  the  dielectric  slab  thickness  is  also  investigated.  It  is  found  that  toe 
thickness  of  the  dielectric  slab  affects  the  resonant  frequencies  within  0.02  GHz  for  the  first 
resonace  and  0.1  GHz  for  the  third  resonace  as  shown  in  Figure  12.  However,  the  resonant 
impedance  of  the  first  and  third  modes  is  changed  over  a  range  of  10  Q  as  shown  in  F^ure  13- 
Therefore,  the  variation  in  slab  thickness  does  not  apear  to  be  an  effictive  procedure  to  fine-tune 
the  meander  line  antenna. 


Conclusions 


A  detailed  investigation  for  optimizing  the  meander  line  antenna  for  personal  wireless 
communication  has  been  presented.  Empirical  equations  are  obtained  and  used  to  achieve  a 
design  in  which  an  external  matching  network  is  not  necessary  An  additional  fine  tumng  for  the 
resonant  frequency  and  impdeance  can  be  achieved  by  Varying  the  slab  width.  Future  studies  will 
be  focused  on  the  optimization  of  the  radiation  pattern  parameters. 
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Figure  12  The  first  and  third  resonant 
frequencies  versus  the  substrate  thickness. 
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Figure  13  The  first  and  third  resonant  impedance 
versus  the  substrate  thickness. 
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Abstract:  A  simple  deconvolution  algorithm,  based  on  a  fourth-order  moment-expansion,  is 
presented  to  compute  the  impulse  response  of  FDTD-modelled  microwave  devices.  The 
algorithm  is  useful  to  characterize  a  structure  by  its  impulse  response  matrix  as  needed  by 
segmentation  techniques,  such  as  diakoptics,  and  to  integrate  data  from  electromagnetic  field 
analysis  into  circuit  simulations.  The  algorithm  has  been  tested  on  microwave  planar  devices 
with  satisfactory  accuracy. 


I.  Introduction 

The  Finite  Difference  Time  Domain  method  [1]  proved  to  be  an  efficient  and  accurate  tool  for 
the  modeling  of  microwave  devices  such  as  guiding  and  radiating  structures.  Moreover,  by  local 
modification  of  the  standard  algorithm,  lumped  linear  and  non-linear  elements  can  be  easily 
included  in  a  model.  However,  finite  computer  resources  limit  the  application  of  the  algorithm  to 
the  analysis  and  optimization  of  small  structures.  Furthermore,  instabilities  and  accuracy  loss  are 
experienced  for  large  domains.  Recently  the  method  has  been  extended  to  larger  systems  by  taking 
advantage  of  segmentation  techniques  such  as  the  time-domain  diakoptics  [2,3],  In  the  last  method 
a  large  structure  is  preliminary  segmented  into  smaller  modules,  each  one  being  simulated 
individually  and  then  characterized  with  its  impulse-response  matrix.  The  whole  structure  is 
therefore  analyzed  as  a  module  interconnection  via  interfaces  performing  convolutions  between 
individual  responses.  Therefore,  the  accurate  computation  of  the  impulse  response  of  a  microwave 
device  can  play  a  very  important  role  in  the  analysis  of  a  composite  structure.  Moreover  the 
knowledge  of  the  impulse  response  allows  the  deembedding  of  equivalent  lumped  circuits  [4,61  and 
therefore  can  be  considered  as  a  gate  towards  the  integration  into  circuit  simulations  of  data 

obtained  from  electromagnetic  field  analysis.  ___  . 

In  this  contribution  the  problem  of  the  impulse-response  computation  by  FDTD  is 
investigated.  Emphasis  will  be  devoted  to  the  selection  of  the  input  waveform  with  respect  to  FDTD 
accuracy  and  bandwidth.  A  simple  deconvolution  algorithm,  based  on  a  moment  expansion,  will  be 
presented.  This  method  performs  a  pure  time-domain  processing  of  the  device  tune-response  for 
suitable  FDTD  input  waveforms  while,  ill-conditioned  inverse  Fourier  transforms  are  avoide  . 
Examples  are  given  for  a  patch  antenna  and  a  planar  low-pass  filter. 
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II.  Statement  of  the  problem 

Consider  an  iV-port  microwave  device.  At  each  input  port  (Fig.  1),  which  can  be  either  a 
cable/microstrip  transition  or  a  waveguide  connection,  an  input  variable  is  an  impressed  voltage 
generator  (voltage  port)  or  an  impressed  current  generator  (current  port)  or  an  inward  modal  field 
distribution  (modal  port).  At  each  output  port,  variables  are  physical  currents  or  voltages,  or 
outgoing  mode  amplitudes  when  the  port  is  closed  on  a  matched  load. 


Fig.  1:  N -port  microwave  linear  passive  device.  Input  and  output  variables 
are  time-domain  signals 

Let  xm{t)  be  a  band-unlimited  (time-dependent)  excitation  at  the  m-th  port  and  yn(t)  the  output 
waveform  at  the  n-th  port.  If  the  device  is  causal,  passive  and  is  governed  by  linear  differential 
equations  with  constant  coefficients,  the  following  convolution  integral  holds: 

(1)  Vn(t)  =  J  xm(r)hmn(t  -  r)dr  =  x*h(t) 

where  hmn(t)  is  the  impulse  response  between  the  m-th  and  n-th  ports.  hm„(t)  can  be  formally 
defined  sis: 

(2) 

where  F~l  is  the  inverse  Fourier  transform,  Xm(w)  =  F[xm(t)]  and  Yn(tv)  =  F[yn(t)).  IfFDTDis 
used,  the  Fourier  transform  of  yn(t)  is  performed  on  data  sampled  at  time  k  At,  where  At  is  the 
FDTD  time-step. 

The  impulse  response  can  be  obtained  directly  in  time  domain  by  solving  an  integral  equation 
(Eq.l)  or  by  frequency-domain  manipulations  and  Fourier  processing  (Eq.2).  In  either  case,  the 
accuracy  of  the  impulse-response  computation  depends  on  both  the  input  signal  waveform  and  the 
FDTD  accuracy.  Numerical  noise  is  mainly  caused  by  numerical  dispersion  [6],  boundary 
staircasing,  approximation  of  numerical  derivatives  and  imperfect  grid  termination. 

Since  the  FDTD  method  performs  a  numerical  finite  approximation  of  an  electromagnetic 
boundary  value  problem,  the  computation  of  Yn(u>)  is  less  accurate  outside  a  frequency  band 
Bm  =  [fmi%rD,  ■  The  highest  frequency  depends  on  spatial  discretization  since,  for  a 

rectangular  grid  with  largest  cell  size  A,  the  shortest  wavelength  which  can  be  accurately  resolved  is 
such  that  \min  >  nA  with  n  =  (8  -f  10)  and  therefore  f^D  —  c/n/A.  A  lower  bound  of  FDTD 
bandwidth  exists  since,  to  accurately  model  slow  field  oscillations,  the  simulation  should  be  carried 
on  for  a  large  number  of  iterations,  but  the  numerical  noise  limits  the  simulation  time-window. 
Though  not  directly  quantifiable,  is  generally  one  or  two  order  of  magnitude  lower  than 

fFDTD 
J  max 
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Therefore,  the  input  waveform  for  FDTD  computation,  must  be  properly  chosen  in  order  to 
extract  the  maximum  information  in  the  band  of  the  numerical  algorithm.  Moreover,  the  errors  in 
Yn(u>)  are  amplified  by  l/Xm{w)  depending  on  frequency,  unless  xm(t)  is  a  Dirac  pulse. 

Generally,  smooth,  narrow  pulses  such  as  the  Gaussian  xa(t),  the  Modulated-Gaussian  xmgOO  or 
the  Deiivated-Gaussian  id c(t)  are  appropriate  FDTD  excitations: 


(3’)  xG(t)  =  e~S%£ 

(3")  i  Mc(t)  =  sin(27r/0<)e“^" 

(3’")  i Dc(t)  =  - 


It  is  possible  to  set  up  the  desired  time-domain  and  frequency-domain  signal  features  by 
selecting  the  parameters  T,  t  and  /0.  The  Gaussian  pulse  (3’),  though  very  short  in  the  time  domain, 
has  a  DC  peak  of  spectral  energy  density.  As  a  consequence,  an  important  amount  of  energy  excites 
those  spectral  components  (of  the  electromagnetic  structure  response)  which  can  not  be  accurately 
predicted  by  the  finite  band-width  of  the  FDTD  method.  On  the  contrary,  the  modulated  Gaussian 
pulse  (3")  is  well  suited  to  perform  high-frequency  analysis  and  the  excitation  band  can  be  properly 
tuned  on  the  FDTD  band-width.  The  drawback  of  this  excitation  is  the  high  number  of  oscillations 
in  the  time  domain  which  often  cause  a  long  settling  time  for  the  FDTD  solution.  An  interesting 
compromise  is  the  derivated  Gaussian  pulse  (3'")  which  has  a  small  low-frequency  energy 
distribution  and  only  two  oscillations. 


HI.  Deconvolution  by  moment  expansion 

Several  stable  algorithms  are  available  [7,8]  to  solve  general  deconvolution  problems  with  both 
discrete  and  continuous  data  without  direct  use  of  Eq.(2).  However,  if  the  input  waveforms  are 
those  commonly  used  with  FDTD  method,  the  impulse  response  can  be  accurately  approximated  by 
a  simple  formula  involving  pure  time  domain  quantities  and  operations.  The  Moment  Expansion 
method  can  be  applied  in  the  form  introduced  by  Papouhs  [9]  for  image  restoration  and  extended  to 
the  specific  application.  Papoulis'  algorithm  works  properly  when  xm(t)  takes  significant  values 
within  an  interval  that  is  small  compared  with  the  variations  of  yn(t).  This  normally  applies  when  a 
Gaussian  or  a  Derivated-Gaussian  waveform  are  used.  The  input-signal  spectrum  can  be  expressed, 
for  any  t0  >  0,  by  means  of  the  following  moment  expansion: 


(4) 


fc= o 


where  the  fc-th  moment  of  x m(t  -  to)  is  computed  as: 

p= o 


(a  «  =  r°o  -  u,?x„wt =£(!)<- 

7-oo  P=0  W' 


A  similar  expansion  holds  for  [Xa(u)e^]‘!  with  moments  {a*}  obtainable  from  0**}-  A  time 
domain  formula  for  hmn{t  -  to)  can  be  obtained  from  Eq.(2)  as: 


(6)  hmn(t  - 10)  =  [(  -  jw)Vn(u;)j 

(=o  *• 
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Finally,  it  easily  shown  that  the  impulse  response  can  be  written  as  the  following  series: 
rt+t»  i  h  i  d3 


(7) 


hmn(t)  =  ' 


°°i  v+“4l+‘*4^  "]^'+ 


l  d 2 


l  d 3 


1  d 4 


a0"C22!^"O33!^  +  a44!^+' 


&n{t  +  to) 


when  xm(t)  has  zero  or  non-zero  mean,  respectively. 

The  above  series  are  rapidly  convergent  [9].  The  numerical  differentiation  of  discrete  data  is 
more  and  more  inaccurate  with  the  increase  of  the  differentiation  order.  Therefore,  only  few  terms 
are  retained  for  reasonable  accuracy.  A  second-order  truncation  of  Eq.(7)  was  suggested  in  [9], 
Instead,  we  found  that  a  fourth-order  truncation  produced  better  results  for  FDTD  analysis  of 
microwave  devices  with  Gaussian  and  Derivated-Gaussian  excitation  in  Eq.(3).  Accordingly,  by 
choosing  to  =  r  the  first  non-zero  moments  are  reported  in  TAB.  I. 


TAB.  I:  Moment  Expansion  coefficients 


Gaussian  pulse 

Derivated-Gaussian  pulse 

1 

v^rv/a 

E9 

-ooT2 

—  aoT2 

a4 

3O0T4 

6oo  T4 

IV.  Numerical  Examples 

Two  numerical  examples  are  reported.  A  multilayer  patch  antenna  and  a  microstrip  low-pass 
filter  were  modeled  by  an  FDTD  code  [10]  and  impulse  responses  were  computed  at  different  ports 
of  the  structures. 


Patch  Antenna  -  A  coaxially-fed  stacked  patch  antenna  (Fig.  2a)  was  FDTD  modeled  by  means  of 
a  non-uniform  grid  with  cell  size  0.9mm  <  Ax,  Ay  <  1.8mm  and  0.3mm  <  Az  <  1mm.  A  simple 
model  of  the  transition  between  coaxial  cable  and  ground  plane  [11]  was  a  real  voltage  generator 
with  a  series  lumped  resistor  J?o  =  50  G  which  is  equivalent  to  the  characteristic  resistance  of  the 
cable.  The  grid  was  terminated  by  6  cell  PML  absorbing  boundary  conditions.  The  time  waveform, 
vq{JL),  of  the  voltage  generator  was  a  Derivated-Gaussian  pulse  with  r  =  65ps  and  half-power 
bandwidth  [1.1GHz,  \QGHz]  which  is  wide  enough  to  predict  the  first  antenna  resonances  for 
standard  working  conditions  (2.4  GHz  to  3.4  GH z).  This  structure  can  be  considered  as  a  one-port 
device.  The  output  variable  is  the  current  i(t)  at  the  source  point.  The  current  i(t)  is  computed  as  the 
magnetic-field  circulation  around  the  excitation  point.  The  impulse  response  h(t)  is  defined  as  the 
inverse  Fourier  transform  of  the  input  admittance: 


(8) 


h(t)  = 


where  Vo(w)  =  F[vo (i)]  and  I(u)  =  P[i{t)}. 
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To  reduce  the  high-frequency  numerical  noise  of  the  FDTD  solution,  i(t)  was  low-pass  filtered 
(15 GHz  cut-off  frequency)  before  numerical  computation  of  second  and  fourth  derivatives  of  i(t)  in 
Eq.  7.  The  computed  impulse  response  evaluation  is  shown  in  Fig.2a.  To  appreciate  the  accuracy  of 
the  method,  the  convolution  v0*h{t)  =  i(t )  was  evaluated  and  compared  with  i{t)  (Fig.2b).  A  good 
agreement  can  be  observed  and  the  relative  root  mean  square  (r.m.s.)  error  6(i,i)  =  y/  f\i{t)  -  i(t)\2dt 
/  J $\i{t)\2dt  was  less  than  7%.  The  difference  in  the  first  transient  is  probably  related  to  the  source 
model  which  causes  unphysical  non-linear  effects. 


Fig  2  a)  Current  impulse  response  of  a  stacked  patch  antenna  (a  —  41.8mm,  b  -  33.2  mm,  dx  —  20.9mm, 
<4  =  5mm,  h  =  1.57mm,  l2  =  5.4mm,  ei  =  2.2,  e2  =  D  at  the  coax/ground  plane  transition;  b)  comparison 
between  the  FDTD  computed  current  and  the  convolution  of  the  derivated  Gaussian  and  the  impulse  response 


obtained  by  numerical  deconvolution. 


Microstrip  filter  -  A  planar  low-pass  filter  [12]  on  a  0.794  mm  substrate  with  2.2  permittivity, 
was  meshed  on  a  non-uniform  grid  with  cell  sizes  0.3  mm  <  Ax,  Ay  ^  1  mm  and 
0.1mm  <  Az  <  0.5mm  .  The  computation  domain  was  truncated  by  6-cell  PML  absorbmg  boundary 
conditions.  The  source  was  modeled  as  a  Huygens-injector  template  [1]  exciting  a  y+ -directed  quasi- 
TEM  mode  (Gaussian  stimulus  with  half-power  bandwidth  [0, 15 GHz]  and  r  =  65 ps).  The  input 
function  was  a  voltage,  Vl(t),  computed  as  the  line  integral  of  the  incident  electric  field  from  the 
ground  plane  to  the  microstrip.  The  output  was  the  voltage,  v2(t),  at  port  2,  defined  ini the  same 
way.  Therefore,  the  impulse  response  h2i(t)  was  defined  as  Jr"1[S'2i]  where  S2iis  the  off-diagonal 

element  of  the  scattering  matrix.  _  ,  . . 

Low-pass  filtering  (20  GHz  cut-off  frequency)  is  applied  to  v2{t)  before  evaluation  of  derivatives. 
The  computed  impulse  response  is  shown  in  Fig.3a.  The  convolution  v2(t)  of  the  impulse  response 
(Fig.3a)  and  t>i(t)  is  plotted  for  comparison  with  t?2(^in  Fig.3b.  The  diagrams  are  indistinguishable 
almost  everywhere  and  the  relative  r.m.s  error  6(v2,v2)  was  less  than  2%. 


V.  Conclusions 

In  this  work  the  time-domain  characterization  of  a  microwave  device  by  its  impulse 
response  has  been  considered.  With  reference  to  the  fall-wave  analysis  of  microwave  devices  by  the 
FDTD  method  and  Gaussian  and  Derivated-Gaussian  input  waveforms,  a  simple  time-domain 
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deconvolution  algorithm  has  been  proposed  to  compute  the  impulse  response  from  FDTD  data.  The 
deconvolution  is  based  on  a  fourth-order  moment  expansion  of  the  input  spectrum.  A  numerical 
analysis  has  been  carried  out  for  two  planar  structures  showing  the  practicability  of  the  method. 


Fig.  3  a)  Planar  low-pass  filter  (size  in  mm)  and  its  impulse  response  A21  vs.  time  by  the  FDTD/Moment 
Expansion  method;  b)  vfFDTD  computed)  and  vzi convolution  of  input  and  impulse  response)  vs.  lime  for  a 
Gaussian  input  V\. 
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Abstract 

The  finite  difference  in  time  domain  (FDTD)  method  has  been  used  to  calculate  the  wave  guiding 
parameters  for  microstrip  components.  The  calculated  characteristic  impedance  and  the  effective 
dielectric  constant  as  a  function  of  frequency  commonly  exhibit  oscillatory  behavior.  The  cause  of  this 
anomaly  is  analytically  and  numerically  shown  to  be  due  to  the  imperfect  absorption  boundary 
condition  (ABC).  An  equivalent  circuit  approach,  by  computing  capacitance  and  inductance  of  a 
microstrip  line,  is  shown  to  be  useful  in  eliminating  the  oscillatory  feature  in  dispersion  relations. 

I.  Introduction 

The  finite  difference  in  time  domain  (FDTD)  method  has  been  used  to  study  the  waveguiding 
properties  of  the  devices  in  a  MMIC  [1-7].  Microstrip  lines  are  the  basic  waveguiding  structure  in  a 
MMIC.  They  are  used  to  transfer  microwave  energy  into  and  out  of  active  devices  in  the  integrated 
circuit  .  For  a  proper  design  of  a  matching  circuit,  one  needs  to  know  two  important  microwave 
parameters  associated  with  microstrip  lines.  One  is  the  characteristic  impedance  and  the  other  is  the 
effective  dielectric  constant.  In  FDTD  analysis,  the  frequency  dependence  (dispersion  relation)  of  these 
microwave  parameters  can  be  calculated  in  a  straightforward  manner.  However,  the  results  commonly 
exhibit  oscillatory  behavior.  This  behavior  is  not  seen  in  the  results  obtained  using  a  full  wave  analysis 
in  the  frequency  domain  and  it  is  unique  to  the  FDTD  analysis.  It  is  important  to  understand  this 
behavior  in  order  to  improve  the  FDTD  analysis  and  thus  achieve  useful  information  for  a  MMIC  circuit 
design. 


Several  reasons  have  been  speculated  for  this  behavior.  They  are  1)  imperfect  absorbing 
boundary  conditions  (ABCs)  that  generate  artificial  reflection  wave  at  the  computational  boundary;  2)  the 
excitation  source  can  contain  mixed  modes  of  propagation  and  evanescent  waves;  3)  the  continuous 
radiation  loss  spectrum  that  was  taken  into  account  in  FDTD,  but  failed  to  be  implemented  properly  in 
the  conventional  method  for  the  phase  velocity  determination;  and  4)  possible  numerical  errors  due  to  the 
use  of  a  numerical  approximation  of  the  fourier  integration.  Among  these,  the  first  is  recognized  to  be 
the  main  cause.  In  this  paper,  we  will  show  analytically  and  numerically  that  this  is  indeed  the  case. 

With  the  FDTD  results,  the  conventional  way  to  calculate  the  phase  velocity  is  to  use  the  so  called 
multiple-z  method  [1-5].  The  multiple-z  method  uses  multiple  observation  locations  in  the  propagation 
direction  to  monitor  the  wave  propagation  at  different  frequencies.  By  comparing  the  phase  for  each 
frequency  component  at  these  locations,  the  propagation  constant  can  be  derived  from  the  knowledge  of 
the  distance  the  wave  travels  between  the  monitoring  locations.  We  have  proposed  an  equivalent  circuit 
approach  to  solve  the  oscillatory  problem[7].  This  approach  needs  only  one  observation  point  in  the 
propagation  direction  to  monitor  the  wave  and  to  calculate  the  dispersion  relation  of  the  phase  velocity;  it 
is,  therefore,  also  called  the  single-z  method.  The  justification  of  the  equivalent  circuit  approach  will 
also  be  reported  in  this  paper. 
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In  the  next  section,  a  theory  is  presented  to  show  the  oscillatory  behavior  in  the  dispersion  curve 
using  FDTD  results.  Section  III  presents  the  numerical  results  and  discussions  and  is  followed  by  the 
conclusion. 

II.  Theory 

In  the  waveguiding  structure,  the  electric  and  magnetic  waves  propagate  along  the  z-direction. 
The  electric  and  magnetic  fields  as  functions  of  x,  y,  z  and  t  are  given  by: 

E(x,y,z,t)  =  £  JJ dpdo)Em (x, y, /?,  o))eilp:-^ <MS(w  -  wj 

H(x,y,z,t)  =  X  JJ  dpdcoHm(x,  y,  (o)ei[tk~m'°  {m8{w  -  wm) 


where  m  stands  for  the  mode  number.  Assuming  only  the  fundamental  modes  are  propagating,  and  the 
validity  of  linear  circuit  theory  (i.e.,  TEM  or  TEM-like  modes),  the  voltage  and  current  waves  can  be 
given  by: 

V(z,t)  =  J  V0(6>)e~°Ze-ia*d63 


l(z,t)  =  J I0(6))e‘v°  e~icotdco 

where  \)0  is  the  phase  velocity  of  the  fundamental  mode;  V0  is  the  line  integral  of  the  electric  field  of  the 
fundamental  mode  between  the  microstrip  line  and  the  ground  plane  and  I0  is  the  line  integral  of  the 
magnetic  field  of  the  fundamental  mode  around  the  microstrip  metal.  The  fourier  components  of  V(z0,t) 
and  I(z0,t)  at  an  angular  frequency  of  CD  are  given  by: 

V(Zo,6))=V0((o)e^ZO 

.6^ 

I(Zo,(i))  -  I0(a>)eVo^ 

The  time  series  obtained  using  FDTD  contain  artificial  noise  that  is  generated  at  the  computation 
boundaries.  The  most  significant  component  is  from  the  end  boundary  of  the  propagation  path,  ie., 
z=zL.  This  noise  is  superimposed  on  V(z0,co)  and  I(z0,CD).  They  are  now  given  by: 

V(zo,Q) )  =  V0(co)e  °  +  Vn(co)e  Vo< 

^0  i  ~{ 2  Zf_  — Zq  ) 

/(Zo.ffl)  =  We  Vo  +/„(G>)eVo 


where  Vn  and  In  are  the  spectral  noise  (including  possible  phase  reversal)  generated  at  the  computational 
boundary.  One  also  notes  that  the  noise  term  has  a  different  phase  factor  due  to  a  longer  propagation 
length.  The  characteristic  impedance  is  given  by: 


Z0(Zo  •©)  = 


ViZpM 
I(z0,G) ) 


=  Z,« »)[l+<£ 
’'o 
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where  Z0(z0,co)  is  the  position-dependent  characteristic  impedance  and  exhibits  oscillatory  behavior  as  a 
function  of  frequency,  while  Z0((o)  is  the  true  characteristic  impedance  and  is  independent  of  the  monitor 
location,  and 

Atf  =  —  (zL-z0) 

v0 

The  cycle  of  this  oscillation  is  about  u0/(2(zL-  z0).  One  also  notes  that  the  characteristic  impedance  at  a 
low  frequency  (g>h>0)  has  a  maximum  deviation  from  the  true  value. 

The  oscillatory  phenomenon  in  the  frequency  dependent  phase  velocity  can  be  analyzed  similarly. 
Following  the  conventional  multiple-z  method,  the  time  series  of  voltage  wave  are  observed  at  z=  z,  and 
Zj.  The  ratio  of  the  frequency  components  of  both  series  are  given  by: 

Vi^a)  =  e'T0u'--z>)[l+K(eiA^  _ g** 

V(Zpfl»  K 

where  Afl,  =  2(0(zL  -  z ,)/  D0,  and  Afr2  =  2<o(zL  -  z2)/  D0.  The  noise  term  has  both  the  magnitude  and  the 
phase  dependent  on  z„  z2,  Zl  and  frequency,  the  noise  term  adds  a  small  quantity  in  the  phase 
difference  at  the  two  observation  points  which  are  needed  to  determine  the  phase  velocity.  This  small 
quantity  causes  the  oscillation  in  the  frequency  dependence  of  the  phase  velocity.  If  the  distance  between 
the  two  observation  points  is  small  compared  to  the  distance  between  the  observation  points  and  the 

boundary,  then  the  period  in  frequency  is  about  V{2[zL-  (z,+z2)/2]}.  With  some  formulations,  it  can  be 
shown  that  as  ©—>0, 

v0'  =  v0(l  - 

Vo 

where  d0’  is  the  estimated  phase  velocity.  Again,  the  phase  velocity,  at  the  low  frequency,  suffers  the 
most  deviation  from  the  true  value,  as  it  does  for  the  characteristic  impedance  case. 

In  the  equivalent  circuit  approach,  we  calculate  the  equivalent  circuit  components  of  the 
inductance  per  unit  length,  L,  and  the  capacitance  per  unit  length,  C.  For  detailed  information  on  that 
approach  refer  to  Reference  7.  In  the  following,  we  shall  show  that  this  approach  is  immune  to  the 
oscillatory  anomaly. 

The  magnetic  flux  per  unit  length,  \|/,  and  the  charge  per  unit  length,  Q  are  given  by: 

«— s0  i-O-zL-i0) 

yr(z0,CQ)  =  y0((D)e  v°  +  y/n(Q))e  r° 


#— So  i— ( 2;L-zt) 

eUo><»)  =  £>(<*»*  v°  +£.(“>)*  v° 
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where  \j/0  and  \j/n  are  the  frequency  components  of  the  magnetic  flux  of  the  fundamental  mode  and  the 
noise  generated  at  the  boundary.  Qq  and  Qn  are  the  frequency  components  of  the  charge  of  the 
fundamental  mode  and  the  noise  generated  at  the  boundary.  L  and  C  are  then  given  by: 


L(z0,(O) 
C(Zo,fi) ) 


I(Zo,cd)  y  o  70 

=  [1  +  (f-  he™] 

V(zQ,co )  &  V0 


The  characteristic  impedance  and  the  phase  velocity  are  calculated  by: 


z0(z,,m)= 

v„(z0,co)  =  [i =  v0(<a){l +[(t  -  f) + (^  - 

Yo  •'o  tfc  '/o 


Since  the  noise  components  in  both  the  magnetic  flux  and  the  current  are  determined  by  noise 
components  in  the  magnetic  field,  the  ratio  terms  in  the  inner  parenthesis  cancel  each  other.  A  similar 
situation  applies  to  the  noise  terms  for  the  charge  and  the  voltage.  Therefore,  the  noise  terms  have  a  very 
small  effect  on  the  characteristic  impedance  and  the  phase  velocity  in  the  equivalent  circuit  approach.  As 
we  shall  see  in  the  numerical  verification,  this  is  indeed  the  case. 


III.  Results  and  Discussion 


In  this  study,  the  FDTD  method  is  implemented  with  a  dispersive  boundary  condition[5].  A 
simple  source  excitation  scheme[6]  is  used  in  the  present  study.  The  microstrip  line  is  on  GaAs 

substrate  (the  dielectric  constant  £  is  12.9)  with  a  thickness  of  1000  |im.  The  width  of  the  microstrip  line 


is  also  1000  pm.  The  computational  domain  contains  Nx  x  Ny  x  Nz  =  51  x  104  x  120  and  the  spatial 


interval  is  100  pm.  The  source  plane  is  located  at  z  =  20  units.  Fig.  1  shows  the  schematics  of  the 
microstrip  line.  Depending  on  the  parameters  used  in  the  dispersive  boundary  condition,  the  noise 
generated  at  the  computational  boundary  can  be  vastly  different.  Figs.  2  and  3  show  the  electric  filed  as 
functions  of  time  at  three  observation  locations.  The  parameters  used  in  Fig.  2  corresponds  to  a  good 
ABC  with  a  small  noise,  while  the  ones  used  in  Fig.  3  corresponds  to  a  fair  ABC  with  a  large  noise, 
w 


Fig.  1 :  Schematics  of  the  microstrip  line 
with  linewidth  of  W  and  thickness  of  h. 
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(a)  (*» 

Fie  2-  (a)  The  time  series  of  the  electric  field  at  three  different  z  with  a  good  ABC 
implemented,  (b)  the  magnified  time  series  of  electric  field  showing  the  noise  infested 
region. 


Fig  3‘  (a)  The  time  series  of  the  electric  field  at  three  different  z  with  a  fair  ABC 
implemented,  (b)  the  magnified  time  series  of  electric  field  showing  the  noise  infested 
region. 


Using  the  fair  ABC,  Fig.  4  shows  the  characteristic  impedance  as  a  function  of  frequency 
between  0  and  60  GHz.  The  results  of  three  cases  of  different  zL-  z0  values  are  shown.  The  results 
using  the  equivalent  circuit  approach  are  also  shown.  The  period  of  the  oscillation  increases  as  (zL-  z0) 
decreases.  The  periods  are  close  to  the  estimated  values  which  are  12.5,  16.7  and  25  GHz  tor  (zL-  z0)  - 
40  30  and  20  units,  respectively.  The  deviation  from  the  true  value  at  the  low  frequency  is  the 
maximum  as  expected.  The  equivalent  circuit  approach  does  not  show  the  oscillatory  behavior. 
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Fig.  4:  The  characteristic  impedance  as  a  Fig.  5:  The  effective  dielectric  constant 

function  of  frequency.  The  ABC  is  the  as  a  function  of  frequency.  The  ABC  is 

same  as  in  Fig.  2  the  same  as  in  Fig.  2. 


Using  the  fair  ABC,  Fig.  5  shows  the  effective  dielectric  constant  as  a  function  of  frequency 
between  0  and  60  GHz.  The  results  of  three  cases  of  different  zL-  z0  values  are  shown.  The  results 
using  the  equivalent  circuit  approach  are  also  shown.  The  multiple-z  method  shows  oscillatory 
behavior.  The  period  increases  as  z^-  (z0+z2)/2  increases.  The  deviation  at  the  low  frequency  is 
maximum  as  expected.  The  equivalent  circuit  approach  does  not  show  the  oscillatory  behavior. 


To  summarize  this  study,  there  are  several  ways  to  reduce  the  effect  of  artificial  noise  on  the 
microwave  parameters.  The  first  is  to  implement  a  good  ABC  to  reduce  the  generated  noise  at  the 
boundary.  Fig.  6  and  7  show  the  results  using  a  good  ABC.  The  oscillatory  anomaly  is  minimized. 
The  second  is  to  set  up  the  model  in  a  way  that  there  is  a  long  distance  between  the  observation  point  and 
the  boundary.  This  long  distance  reduces  the  influence  of  the  noise.  These  two  method  can  require  a 
long  computation  time.  The  third  is  to  implement  the  signal  processing  techniques  [4]  which  eliminate 
the  effects  of  the  artificial  noise.  Lastly,  the  equivalent  circuit  approach  has  been  shown  to  be  effective 
in  minimizing  the  effect  of  the  noise  due  to  an  imperfect  ABC. 


IV.  Conclusion 


For  microstrip  line  component  analysis  ,  the  traditional  FDTD  method  to  determine  phase 
velocity  is  based  on  multiple-z  values.  The  traditional  method  to  determine  characteristic  impedance  is  to 
compute  the  ratio  between  the  voltage  and  current  components.  The  results,  usually,  show  an  oscillatory 
feature  in  the  dispersion  relations.  This  phenomenon  was  analyzed.  It  was  shown  to  be  caused  by  the 
noise  generated  at  the  boundary  due  to  an  imperfect  ABC.  The  equivalent  approach  using  FDTD  results 
evaluates  the  capacitance  and  the  inductance  at  a  single  monitor  location  in  the  direction  of  propagation. 
The  dispersion  relation  of  the  phase  velocity  and  the  characteristic  impedance  show  little  oscillatory 
anomaly.  This  is  because  the  noise  terms  are  canceled  from  each  other  in  the  capacitance  and  inductance 
evaluation. 
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Fig.  6:  The  characteristic  impedance  as  a 
function  of  frequency.  The  ABC  is  the 
same  as  in  Fig.  3. 


o  10  20  30  40  50  60 

Frequency  (GHz) 

Fig.  7:  The  effective  dielectric  constant 
as  a  function  of  frequency.  The  ABC  is 
the  same  as  in  Fig.  3. 
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Abstract 

The  overall  objective  of  this  study  is  examine  an  implicit  characteristic-based  approach  for  numerical  so¬ 
lution  of  Maxwell's  time-dependent  curl  equations  in  flux  conservative  form.  The  present  method  combines  a 
characteristic  based  finite  difference  spatial  approximation  with  an  implicit  lower-upper  approximate  factorization 
(LU/AF)  time  integration  scheme.  This  approach  is  advantageous  for  three-dimensional  applications  because 
the  characteristic  differencing  enables  a  two-factor  approximate  factorization  that  retains  its  unconditional  stabil¬ 
ity  in  three  space  dimensions,  and  it  does  not  require  solution  of  tridiagonai  systems.  Results  are  given  for  a 
a  one-dimensional  model  problem  involving  propagation  and  scattering  for  free  space  and  dielectric  materials 
using  both  uniform  and  nonuniform  grids.  The  model  problem  is  designed  to  provide  quantitative  insight  into 
both  accuracy  and  efficiency.  The  explicit  FDTD  algorithm  is  used  as  a  convenient  reference  algorithm  for  com¬ 
parison.  The  one-dimensionai  results  indicate  that  for  low  frequency  problems  on  a  highly  resolved  uniform  or 
nonuniform  grid,  this  LU/AF  algorithm  can  produce  accurate  solutions  at  Courant  numbers  significantly  greater 
than  one,  with  a  corresponding  improvement  in  efficiency  for  simulating  a  given  period  of  time. 


1  Introduction 

1.1  Background 

In  the  regime  of  finite-difference  solutions  for  Maxwell’s  time  dependent  curl  equations,  explicit  methods  such 
as  the  Finite-Difference  Time-Domain  (FDTD)  method  [1  ]— [4],  the  Transmission  Line  Method  (TLM),  or  (more 
recently)  Finite- Volume  Time-Domain  (FVTD)  methods  have  been  standard  techniques  for  the  past  20-30  years. 
These  methods  are  relatively  simple  and  efficient,  and  have  proven  very  robust  and  adequate  for  many  types  of 
problems. 

Since  these  explicit  methods  are  conditionally  stable,  their  maximum  time  step  is  limited  by  a  stability  con¬ 
straint  that  depends  on  the  focal  grid  spacing.  The  conditional  stability  restriction  can  increase  the  cost  of 
analyzing  electromagnetic  (EM)  problems,  especially  when  nonuniform  grids  are  needed,  because  a  small  local 
grid  structure  can  require  a  time  step  smaller  than  would  otherwise  be  required  to  accurately  resolve  the  physical 
wave  propagation.  On  the  other  hand,  implicit  algorithms  can  provide  unconditional  stability  which  allows  the 
time  step  to  be  selected  for  accuracy  and  resolution  of  the  frequency  content  of  the  excitation  sources,  without  a 
stability  restriction. 

Implicit  algorithms  were  first  proposed  for  electromagnetics  by  Holland  [5],  They  have  not  received  widespread 
use  in  electromagnetics,  however,  perhaps  due  to  increased  complexity  and  in  some  cases  the  need  to  solve  a 
linear  system  of  equations.  For  example,  centered  spatial  difference  approximations  in  conjunction  with  alternat¬ 
ing  direction  implicit  (ADI)  schemes  require  the  solution  of  (for  example)  tridiagonal  systems  for  each  coordinate 

'This  work  was  sponsored  in  part  by  the  Southeastern  Center  for  Electrical  Engineering  Education  under  Grant  #  SCEEE-97-004  and  by 
NSF  under  Grant  #  EEC-8907070 
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direction.  Furthermore,  although  this  type  of  scheme  is  unconditionally  stable  for  the  first-order  wave  equation 
in  two  dimensions,  it  is  known  to  be  unconditionally  unstable  in  three  dimensions  [6],  The  usual  Fourier  stability 
analysis  does  not  account  for  nonperiodic  boundary  conditions,  however,  and  it  has  been  shown  using  a  matrix 
stability  analysis  that  this  three  dimensional  centered  difference  algorithm  is  conditionally  stable  when  used  with 
upwind  boundary  conditions  [7]. 

There  has  been  recent  work  in  developing  characteristic  based  explicit  and  implicit  algorithms  for  CEM  that 
are  related  to  some  solution  algorithms  from  computational  fluid  dynamics.  These  methods  have  been  developed 
for  curvilinear  three  dimensional  grids,  using  both  finite  difference  and  finite  volume  approximations.  Shankar 
et  al  [8]  [9]  developed  explicit  characteristic-based  finite-volume  methods,  using  both  flux-vector  and  flux- 
difference  splittings  for  electromagnetics.  Shang,  et.  al.  [10]-[1 7]  have  developed  both  implicit  anc I  explicit 
characteristic  based  methods.  Shang  [10HH]  developed  an  implicit  characteristic  based  ADI  method  that 
leads  to  simple  bidiagonal  systems  that  are  easily  solved,  rather  than  tridiagonai  or  pentadiagonal  systems. 
This  characteristic-based  scheme  is  unconditionally  stable  in  two  dimensions  and  conditionally  stable  in  three 
dimensions  [11].  Shang  and  Fithen  [16]  also  developed  characteristic  based  finite  difference  and  finite  volume 
schemes  using  Runge-Kutta  time  integration,  in  curvilinear  coordinates.  Gaitonde  and  Shang  [18]  and  Turkel 
[4]  have  recently  explored  high  order  accurate  compact  differencing  schemes  that  require  tridiagonal  solutions 
for  spatial  approximations,  whether  implicit  or  explicit  time  integration  is  used.  Turkel  [4]  has  suggested  a  two 
dimensional  implicit  version  based  on  an  ADI  scheme. 


1.2  Present  Study 

A  characteristic-based  approach  to  spatial  differencing  has  advantages  in  constructing  implicit  algorithms,  as 
well  as  in  utilizing  the  natural  method  of  incorporating  boundary  conditions  for  hyperbolic  systems.  The  ability  of 
characteristic  based  methods  to  provide  accurate  nonreflecting  solutions  at  outer  boundaries  is  well  known  and 
is  discussed  for  electromagnetics  by  Shang  [1 0] — (1 1  ]  and  others. 

The  present  method  combines  a  characteristic  based  approach  to  spatial  differencing  with  an  implicit  lower- 
upper  approximate  factorization  (LU/AF)  time  integration  scheme  that  can  be  developed  as  a  two-factor  scheme 
that  retains  its  unconditional  stability  in  three  space  dimensions,  unlike  three-factor  ADI  schemes.  The  LU/AF 
scheme  also  avoids  the  need  to  solve  tridiagonal  implicit  systems.  A  two-factor  approximate  factorization  for 
three  dimensions  was  first  suggested  by  Jameson  and  Turkel  [1 9],  This  type  of  factorization  (two  implicit  passes 
in  three  dimensions)  has  been  combined  with  characteristic  based  differencing  by  Beik  and  Whitfield  [20].  This 
general  approach  has  been  further  developed  for  CFD  and  is  discussed,  for  example,  in  [21]. 

Complex  practical  applications  often  involve  nonperiodic  solutions  in  finite  regions  with  complex  geometry  and 
nonuniform  grids.  Grid  nonuniformity  is  an  important  consideration  because  conditional  stability  critena  typically 
link  the  maximum  permissible  time  step  to  the  local  grid  spacing  rather  than  to  the  underlying  physical  time  and 
space  "sampling”  requirements  of  the  solution.  Stable  implicit  methods  can  benefit  from  choosing  the  time  step 
to  satisfy  an  accuracy  requirement  rather  than  a  stability  constraint,  with  the  consequence  that  fewer  time  steps 
are  needed  for  a  fixed  simulation  time.  Accordingly,  a  one-dimensional  model  problem  with  nonunrform  grid  is 
developed  to  explore  the  influence  of  grid  nonuniformity  on  solution  accuracy  and  efficiency.  The  explicit  FDTD 
algorithm  is  used  as  a  convenient  reference  algorithm  for  comparison. 


2  One-Dimensional  LU/AF  Algorithm 

This  section  details  the  theoretical  background  for  development  of  a  second-order  accurate  LU/AF  algorithm. 
This  algorithm  is  used  in  all  comparisons  with  the  FDTD  method,  and  a  first  order  algorithm  is  used  at  points 
adjacent  to  the  outer  boundaries  of  the  computational  grid. 
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Maxwell’s  equations  for  linear,  homogeneous  and  lossless  media  in  the  one-dimensional  case  (taking  d/dy  = 
d / dz  =  0)  are 

dieE^)_dHy_()  9K)  dEz  , 
dt  dx  ’  dt  dx  {) 

These  equations  can  be  rewritten  in  flux  conservative  form  as 

«i  +  2Z  =  0  (2) 

dt  +  dx  [) 

where  q  =  [e£. ,  nHy]T,  /  =  [ -Hv  ,  -E~]T  and  T  denotes  transpose.  To  develop  the  O  (A*2,  Aar)  LU/AF 
algorithm,  the  flux  conservative  form  of  (??)  is  rewritten  to  include  the  electric  and  magnetic  conduction  currents 

» 


0  er'lfi 


where  A  and  P  are  given  by 

M-v.  "o'l'  Mt  A,]  » 

The  time  derivative  in  (3)  is  approximated  by  a  ,/3- weighted,  OlAt2)  difference  equation  given  by 
£i  ~  (2/3  +  1  )g?+1  -  W  +  (2/3  -  l)*?"-1 

dt  ~  2Ai  (5) 

The  spatial  derivative  is  replaced  by  a  /3-weighted  average  between  time  level  n  + 1  and  n.  This  can  be  rewritten 
using  an  operator  notation  as 

ff*/? (A 2ilt  +  A ij-)n+l  +  (1  - 0)  +  A tJ~)n  (6) 

The  A.£  operators  are  0(A*2)  difference  operators  to  be  defined  shortly.  The  parameter  0  can  be  used  to 
construct  a  series  of  explicit  and  implicit  schemes.  For  example,  ft  0  =  0,  this  results  in  a  leapfrog  scheme; 
0  =  0.5  results  in  a  Crank-Nicolson  scheme  and  0  =  1  results  in  an  Euler  implicit  scheme.  Using  (5)  and  (6),  the 
finite-difference  equation  for  (3)  is 

W  +  +  {A-jt  +  Ay;-)-1  +  (i  -  m  (a  jyf  +  A+fi-)"  = 

-0Pq?+1-(l-0)Pq?  (7) 

Using  a  flux  vector  splitting  approach,  this  can  be  rearranged  as 

[I/(2At)  +  09P  +  09  (A^i+(-)  +  A+I-(.))]  Aq?  =  -/&  (8) 

The  residual,  R^,  is  defined  by 

R?,  =  6  +  A nit*  +  A A$T 1 }  (9) 

where  9  =  1/(20  +  1).  The  difference  operators  in  equation  (9)  are  replaced  by  0( Ax2)  backward  and  forward 
upwind  difference  operators  on  three-point  one-sided  stencifs  defined  by 

A £(•)  =  (3(-)i  -  +  (•) i-2)  /(2Ax),  AJ(-)  =  (-3(-)»  +  4(-)i+1  -  (.)f+2)  /(2Ax)  (10) 

Substituting  (/*)”  =  A±qf  into  (9),  the  residual  is  given  by 

=  *  {ptf  +  A +  A  +A-#  +  ^f^Agr1  J  (11) 

Equation  (8)  is  an  0(At2,  Ax2),  unfactored,  upwind  scheme  for  electromagnetics.  The  LU/AF  scheme  is  defined 
by  factoring  the  left  side  of  (8)  into  two  operators,  each  designed  for  a  forward  and  backward  grid  sweep  as  in 
the  first  order  implementation.  The  LU/AF  scheme  is  then  given  by 

[I/(2At)  +  09P  +  09A^A+(-)]Aq-  =  -/%  (12) 

[1/(2 At)  +  09 P  +  09A$iA-(-)}  A#  =  [//(2At)  +  09P]  Aq-  (13) 
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3  Fourier  Analysis 

A  Fourier  analysis  shows  that  the  second-order  upwind  LU/AF  algorithm  is  unconditionally  stable  for  0  >  1/2, 
and  that  it  contains  both  numerical  dissipation  and  dispersion.  The  dissipation  is  present  due  to  even  order 
spatial  derivatives  in  the  truncation  error  which  are  a  result  of  the  upwind  approximation.  These  schemes  are 
consistent  approximations  with  0(At2,  Ax1)  truncation  error  for  the  second-order  algorithm.  A  complete  Fourier 
analysis  is  omitted  here  for  the  sake  of  brevity,  but  is  shown  in  [22]. 


4  Boundary  Conditions 

The  characteristic  based  LU/AF  method  requires  no  extraneous  boundary  condition  such  as  the  Liao  absorb¬ 
ing  boundary  condition  [23]  or  the  PML  [24].  The  FDTD  method  uses  a  spatial  central  differenceoperator,  which 
for  a  wave  propagating  from  left  to  right,  eventually  requires  a  grid  point  outside  the  domain.  This  requirement 
introduces  an  additional  equation  (i.e.  boundary  condition)  to  solve  the  system  and  introduces  information  into 
the  solution  that  is  not  required  by  Maxwell’s  equations.  Using  an  upwind  characteristic  based  approach,  the 
interior  point  algorithm  calculates  the  left-going  characteristic  at  the  left  boundary  (i.e.  at  i  =  0)  and  the  right- 
going  characteristic  at  the  right  boundary  (i.e.  at  i  =  imax).  Therefore,  the  only  additional  information  required 
is  information  about  waves  that  are  entering  the  domain.  Waves  exiting  the  domain  are  naturally  handled  by  the 
interior  point  algorithm.  Therefore,  the  characteristic  boundary  conditions  are  implemented  as  follows:  at  gnd 
point  i  =  0,  a  first-order  version  of  (13)  is  used  along  with  a  specification  of  the  incoming,  right-going  flux,  ft. 
At  grid  point  i  —  imax,  a  first-order  version  of  (12)  is  used  along  with  a  specification  of  the  incoming,  left-going 
flux,  ft  .  Therefore,  the  only  additional  information  introduced  at  the  boundary  is  nothing  more  than  what  is 
requiredby  the  physical  system.  In  multidimensional  problems,  the  local  coordinates  at  the  outer  boundanes 
are  rotated  to  align  with  the  direction  of  wave  propagation  defined  by  E  x  H.  The  characteristic  equations  are 
developed  along  this  direction  and  are  appropriately  applied  at  the  boundaries.  This  procedure  was  discussed 
and  outlined  by  Shang  [11]. 


5  Model  Problem  Results 


The  model  problem  results  presented  in  this  section  demonstrate  that  the  one-dimensional  implicit  characteristic- 
based  LU/AF  algorithm  can  produce  accurate  results  on  a  nonuniform  grid  for  time  steps  significantly  larger  than 
the  maximum  permissible  for  a  typical  conditionally  stable  scheme.  Although  the  operation  count  for  the  implicit 
scheme  is  higher,  as  the  mesh  variation  is  increased,  fewer  time  steps  are  needed  for  an  accurate  solution  over 
a  fixed  simulation  time,  and  this  savings  more  than  offsets  the  additional  operations  (beyond  a  certain  mesh 
variation). 


The  second-order  LU/AF  algorithm  was  tested  by  implementing  equations  (12)  and  (13)  for  interior  grid 
points  away  from  absorbing  boundaries  and  first-order  equations  at  grid  points  next  to  the  absorbing  bour>  ■ 
aries.  Characteristic-based  boundary  conditions  were  used  to  terminate  the  computational  domain  and  the 
incoming  flux  (ft  J  at  the  right  boundary  was  set  to  zero.  The  code  was  initialized  by  writing  a  time  snapshot  of 
a  propagating  pulse  in  the  grid.  Several  different  types  of  problems  were  analyzed  with  the  LU/AF  algorithm  and 
compared  with  the  FDTD  method  in  an  attempt  to  assess  the  characteristics  of  the  LU/AF  algorithm  as  applied 
to  a  one-dimensional  problem  with  a  non-uniform  grid.  These  are  outlined  in  the  following  sections. 


5.1  Propagation 

To  assess  the  dispersion  and  dissipation  characteristics  of  the  LU/AF  algorithm,  several  propagation  problems 
were  analyzed  using  free  space  and  dielectric  materials  and  were  compared  with  FDTD.  A  Gaussian  pulse  of 
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was  used  as  an  excitation  source  for  both  propagation  and  scattering  problems.  For  propagation,  the  code  was 
changed  to  use  periodic  boundary  conditions  to  enable  simulation  of  propagation  over  large  distances  compared 
to  the  shortest  wavelength  contained  in  the  frequency  spectrum  of  the  pulse.  To  illustrate  the  potential  benefit 
of  the  LU/AF  algorithm  over  conventional  explicit  schemes,  the  main  parameters  of  interest  are  the  time  reso¬ 
lution  (i.e.  number  of  time  steps/period)  and  the  grid  resolution  (i.e.  number  of  cells/wavelength)  of  the  highest 
frequency  of  interest  in  any  given  problem.  These  parameters  will  be  designated  as  Nt  and  Nx,  respectively.  The 
other  parameter  of  interest  is  the  mesh  stretch  ratio,  Ma.  The  highest  frequency  of  interest,  fmax,  is  calculated 
based  upon  Nx  and  the  largest  cell  size,  and  the  time  step  for  the  implicit  scheme  is  calculated  based  upon  fmax 
and  the  desired  time  resolution,  Nt.  For  the  FDTD  method,  the  time  step  was  calculated  based  upon  the  Courant 
stability  condition  using  the  smallest  grid  size. 


5.1.1  Uniform  Grid 


Figure  1 : 


The  first  propagation  problem  involves  propagation  in  free  space  on  a  uniform  mesh.  The  time  and  grid 
resolutions  are  Nt  =  30  and  Nx  =  30,  the  cell  size  was  1  cm,  the  time  step  was  33.3  ps  and  u  =  1.  The 
maximum  frequency  of  interest  was  1  GHz  and  the  problem  space  was  2000  cells.  The  pulse  was  allowed  to 
propagate  a  distance  equal  to  100  wavelengths  of  the  minimum  wavelength,  which  corresponds  to  a  distance 
of  about  30  meters.  Figure  1  shows  the  electric  field  versus  distance  obtained  after  3000  time  steps  computed 
using  the  exact  solution,  FDTD  and  the  LU/AF  algorithm. 

1  - |-Tr — I - 1 - 1 -  Note  the  LU/AF  scheme  has  slightly  less  accurate  results  due  to 

_  o  8  -  f\  Exact _ _  the  larger  dispersion  error;  but  the  general  shape  of  the  pulse  is 

£  I  1  FDTD -  still  acceptable. 

>  0.6  -  M  LU/AF  — •  _ 

0.4  -  /  \  -  5.1.2  Nonuniform  Grid 

%  0-2  "■  /  \  “  To  demonstrate  the  LU/AF  scheme  on  nonuniform  meshes,  the 

m  o  _ / . .V _  same  problem  outlined  above  was  simulated  using  mesh  stretch 

,  ,  ,  ,  ratios  of  M3- 1 0:1  and  M,= 1 00:1  and  the  mesh  was  periodic  every 

■°-2  ~ — Tj — ““ — n — ~~ — "L  1 0  cells.  Figure  2  illustrates  an  expanded  section  of  the  nonuni- 

x  (meters)  form  mesh  for  a  mesh  stretch  ratio  of  M,  =10:1.  The  smallest 

grid  cells  force  the  FDTD  algorithm  to  take  a  time  step  1/1 0th  of 
Figure  1:  the  previous  case,  and  it  must  run  10  times  as  many  time  steps 

to  propagate  the  pulse  the  same  distance.  However,  the  LU/AF 
algorithm  can  take  the  same  time  step  as  before,  because  the  time  resolution,  Nt  =  30,  has  been  preserved  and 
the  time  step  is  set  solely  upon  accuracy  requirements,  not  on  the  Courant  stability  condition. 

The  maximum  ceil  size  remains  constant,  and  since  Nx  is  also  the  ill!  I  |>|||||fl!|!!  llljAlll  1 1  |jl|H||||| 

same,  the  maximum  frequency  of  interest  remains  constant  at  1  GHz.  ||  i  |  B[||  |  [  |  H1IB1 

The  maximum  grid  coordinate  changes  from  20.0  in  the  uniform  mesh  0  005  Q1  015  02  025  03 

case  to  8.17  for  the  10:1  nonuniform  mesh.  The  Courant  numbers  in  ’  x  (meters) 

the  grid  for  FDTD  are  in  the  range  0.1  <  v  <  1  and  the  Courant  _. 

numbers  for  the  LU/AF  algorithm  are  in  the  range  1  <  v  <  10.  Figure  '9ure  * 

3  shows  the  electric  field  versus  distance  obtained  with  the  exact  solution  and  after  30,000  time  steps  computed 
using  FDTD  and  after  3,000  time  steps  using  the  LU/AF  algorithm. 

1  - mr — | - 1 - 1 -  Note  the  agreement  is  very  good  for  both  methods  in  this  case.  In 

___  o  8  -  A  Exact  _ _  fact,  Figure  4  shows  the  error  in  electric  field  versus  distance  for 

£  M  FDTD -  both  methods  and  the  errors  are  almost  equal  in  magnitude. 

-  0.6  -  M  LU/AF  -  jhe  LU/AF  method  required  about  52  seconds  and  504  x  106  op- 

■§04-/1  _  erations,  but  the  FDTD  method  required  about  66  seconds  and 

o  I  \  480  x  106  operations.  Note,  however,  the  improved  performance  of 

§  0.2  -  /  \  -  the  LU/AF  algorithm  away  from  the  area  of  the  main  pulse.  Since 

□  0  _ / . V, _  the  mesh  stretch  ratio  was  1 0  for  this  problem,  the  FDTD  method  is 

forced  by  the  Courant  stability  condition  to  take  1 0  times  the  num- 

*0-2  - 1 - 1 - 1 - 1 -  ber  of  time  steps  to  simulate  the  same  amount  of  physical  time. 
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Figure  2: 


Figure  3: 
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5.2  Lossy  Dielectric  Material 
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To  illustrate  propagation  in  dielectric  materials,  the  problem 
space  was  filled  with  a  lossy  dielectric  material  with  parameters 
e  =  4eo,  fi  =  no  and  a  —  0.02.  The  pulse  was  allowed  to  propa¬ 
gate  on  a  uniform  mesh  for  approximately  35  wavelengths  at  the 
minimum  wavelength.  Rgure  5  shows  the  results  of  FDTD  versus 
LU/AF  with  very  good  agreement  between  the  two  methods. 

A  similar  level  of  agreement  is  found  with  the  same  problem  on 
a  nonuniform  grid.  The  difference  in  these  results  is  most  likely 
attributed  to  the  nonzero  factorization  error  which  occurs  when  the 
conduction  current  term  is  included. 


Rgure  4: 

6  Conclusions 


This  paper  has  explored  the  benefits  of  using  implicit  finite- 
difference  methods  for  computational  electromagnetics.  An  im¬ 
plicit,  second-order  accurate,  LU/AF,  characteristic  based  algo¬ 
rithm  for  electromagnetics  has  been  implemented  and  tested 
on  one-dimensional  model  problems  for  uniform  and  nonuniform 
grids.  The  one-dimensional  model  problem  results  for  the  charac¬ 
teristic  based  implicit  scheme  demonstrate  that: 

1 .  Accurate  solutions  for  wave  propagation  can  be  obtained  us¬ 
ing  Courant  number  significantly  greater  than  one  on  nonuni¬ 
form  grids. 


0.008 
__  0.006 
^  0.004 

~  0.002 
s=  0 

•|  -0.002 

®  -0.004 

in 

-0.006 
-0.008 

10  15  20  25  30  35 

x  (meters) 


2.  Stability  of  the  implicit  scheme  allows  a  given  amount  of  time  Rgure  5: 

to  be  covered  in  fewer  time  steps  than  a  conditionally  sta¬ 
ble  scheme,  and  this  can  significantly  improve  efficiency  when  nonuniform  grids  are  needed.  This  is  also 
extremely  beneficial  for  low  frequency  excitation  sources  where  geometric  details  may  force  a  highly  over¬ 
sampled  mesh  (uniform  or  nonuniform)  in  certain  regions. 


This  approach  appears  promising  for  development  of  stable,  accurate  and  efficient  implicit  LU/AF  schemes  for 
complex  two  and  three  dimensional  applications. 
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In  the  early  days  of  radio,  a  large  percentage  of  antennas, 
amateur  and  commercial,  were  fed  at  the  end,  rather  than  at  the 
center  as  is  now  more  common.  The  major  factors  in  the  change¬ 
over  appear  to  be: 

-  The  move  to  higher  frequencies 

-  Development  of  flexible  solid- insulated  transmission  lines 

-  Development  of  the  Yagi-Uda  array,  with  simplified  feed. 

A  result  of  the  change-over  is  that  standard  references  have 
very  little  information  on  end  fed  antennas.  There  is  usually  a 
mention  of  the  type,  and  perhaps  some  examples  of  practical 
types,  but  no  real  analysis.  At  most  there  will  be  a  general 
discussion.  Kraus,  for  example,  has  no  title  or  index  entry. 
The  ARRL  Antenna  Book  has  several  entries  and  some  discussion 
of  general  principles  and  some  practical  examples,  but  no 
references  to  theory.  Schelkunoff  and  Friis  mention  that  such 
antennas  may  be  fed  against  the  capacitance  of  the  generator 
(to  the  rest  of  the  world)  ,  or  by  connection  to  one  side  of  an 
open  wire  transmission  line.  To  these  should  be  added  feed 
against  another  antenna  section,  or  against  a  ground  or 
counterpoise. 

This  paper  considers  some  aspects  of  end  feeds,  the  primary 
purpose  being  to  determine  if  the  available  practical  design 
information  needs  extension. 

END-POINT  DRIVE  IMPEDANCE 

If  first-order  antenna  theory  is  followed,  it  is  not  possible 
to  end  feed  an  antenna,  since  the  current  is  zero  and  drive 
impedance  is  infinite:  no  power  can  be  transferred.  However, 
the  theory  neglects  the  possibility  of  the  antenna  being 
coupled  to  another  conductor,  and  in  any  event  the  current  does 
not  go  to  zero  until  atomic  distances  are  reached. 

Current  moment  analysis  methods  do  not  provide  for  end  feeds. 
It  appears  that  the  common  practice  for  determining  the  drive 
impedance  at  the  antenna  end  is  to  determine  the  center 
impedance  of  an  antenna  of  twice  the  length,  and  use  half  of 
this  as  the  drive  impedance.  An  alternate  is  to  feed  the 
antenna  against  ground,  and  use  the  drive  impedance  directly. 
With  a  frequency  sweep,  these  give  a  visualization  of  end  feed 
drive  characteristics. 
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For  example.  Fig.  1  shows  a  computer  generated  plot  of  center 
impedance  of  a  0.5  meter  dipole  of  1  mm  diameter,  over  the 
range  of  150  to  1400  MH2,  as  calculated  by  NEC-2  (EzNEC 
version)  .  The  step  value  was  50  MHz,  sufficient  to  give  good 
resolution  for  the  series  resonance  points.  However,  the  rate 
of  change  of  reactance  is  very  large  around  the  parallel 
resonant  points,  as  shown  by  the  Vee-shapes  of  the  point-to- 
point  plot.  The  added  arc  is  a  better  estimate  of  these 
impedances .  See  Kraus  and  Schelkunof f  and  Friis  for  theoretical 
and  approximate  methods  of  deriving  these  plots.  This  figure 
indicates  that  the  drive  resistance  at  the  first  two  parallel 
resonant  points  is  600  and  1000  ohms,  very  closely. 

A  partially  independent  calculation  of  the  drive  impedance  is 
possible  by  noting  the  drive  impedance  at  the  first  segment  of 
the  moment  method,  and  moving  this  segment  closer  to  the 
antenna  end  by  increasing  the  number  of  segments.  The  resulting 
curve  should  approach  the  drive  impedance.  Such  a  plot  is  shown 
in  Fig.  2.  for  the  range  of  11  segments  to  251.  The  initial 
increase  in  drive  resistance  as  the  number  of  segments 
increases  appears  to  reflect  an  increase  in  calculation 
accuracy  (MiniNEC  shows  the  same  pattern) .  The  indicated 
increase  in  drive  point  reactance  was  not  expected.  It  appears 
to  arise  from  the  continued  decrease  in  the  capacity  of  the 
segment  between  the  drive  point  and  the  antenna  end  as  segment 
length  decreases:  the  indicated  reactance  is  the  sum  of  this 
plus  that  of  the  longer  part  of  the  antenna,  which  changes 
slowly  with  segment  size.  The  estimated  first  parallel  resonant 
end  drive  resistance  is  about  500  ohms  by  this  method. 

THE  ZEPP  ANTENNA 

The  classical  Zepp  antenna  is  a  half  wave  dipole  fed  at  the 
end  by  connection  to  one  side  of  an  open-wire  one-quarter  wave 
transmission  line.  Fig.  3  shows  a  view  of  the  antenna  in  free 
space,  with  the  calculated  segment  currents.  The  antenna 
current  is  very  nearly  sinusoidal,  with  a  small  current  at  the 
driven  end.  The  currents  on  the  line  wires  are  essentially 
mirror- images  except  for  the  10%  closest  to  the  antenna. 
Calculated  drive  impedance  is  37+j46  ohms  with  the  wire  size 
and  line  spacing  used,  1mm  wire  and  a  spacing  of  1cm.  Fig.  4 
shows  the  SWR  curve  for  50  ohm  feed  to  the  transmission  line 
short,  indicating  resonance  at  294  MHz.  The  imbalance  on  the 
transmission  line  indicates  that  it  is  radiating.  This  shows  in 
the  free- space  azimuth  pattern  plot  of  Fig.  5  {It  also  shows  in 
the  elevation  plot) . 

With  a  wide  range  "antenna  tuner",  the  Zepp  is  a  good  multi¬ 
wave  antenna.  For  example.  Fig.  6  shows  the  current  pattern  at 
the  second  harmonic,  where  the  feed  point  impedance  is 
1850+j 3050  ohms.  The  feedline-  antenna  interaction  pattern  is 
the  same .  The  azimuth  antenna  pattern  becomes  more  nearly  a 
circle,  the  result  of  the  four  lobes  of  the  antenna  plus  feed 
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radiation. 


ZEPP  VARIATIONS 

A  simple  variation  of  the  Zepp  is  to  eliminate  the  unconnected 
transmission  wire,  feeding  the  other  wire  against  ground,  the 
antenna  sometimes  called  the  Marconi.  The  resulting  currents  at 
first  resonance  are  shown  in  Pig.  7,  the  drive  impedance  being 
94+ jl  ohms.  For  these  dimensions  the  pattern  has  a  large 
vertical  lobe,  shown  in  Fig.  8.  If  the  height  is  selected  to 
give  a  half-wave  transmission  line,  the  currents  are  as  shown 
in  Fig.  9,  with  a  drive  impedance  against  ground  of  2235+ j 215 
ohms.  Patterns  are  shown  in  Fig.  10.  Another  variation  adds  a 
grounded  vertical  section  at  the  far  end  of  the  Marconi, 
sometimes  called  the  half-square  beam.  Drive  impedance  is  124- 
j50  ohms,  with  currents  as  shown  in  Fig.  11.  Patterns  are  shown 
in  Fig.  12. 

No  attempt  to  optimize  these  Zepp  variations  for  a  particular 
characteristic  is  attested  here.  While  this  can  be  done,  a 
wide-range  antenna  tuner  will  compensate  for  most 
characteristics  encountered . 

THE  J-POLE  ANTENNA 

There  is  a  family  of  line -fed  antennas,  with  the  Zepp  half¬ 
wave  radiator  at  right  angles  to  the  feeders,  and  the  ”J-Pole" 
with  it  continuing  the  line  of  one  feeder.  The  J-pole 
arrangement  and  calculated  current  distribution  is  shown  in 
Fig.  13.  Drive  impedance  at  the  center  of  the  bottom  short  is 
40+ j 8  ohms;  match  to  coax  is  customarily  secured  by  tapping  up 
on  the  open  wire  line  a  short  distance.  The  pattern  is  affected 
by  the  imbalanced  transmission  line  currents,  as  shown  in  Fig. 
14.  However,  the  distortion  is  rarely  noticed  in  use  since  it 
is  small . 

Some  of  the  practical  design  in  the  literature  depart  from  the 
1/4  and  1/2  wave  dimensions  by  a  small  amount,  apparently  to 
secure  better  SWR  bandwidth.  A  few  checks  indicate  that  some 
improvement  can  be  secured,  but  the  possibilities  have  not  been 
explored  in  detail . 

CONCLUSIONS 

While  this  study  has  shown  some  interesting  detail  about  the 
operation  and  performance  of  end- fed  antennas,  the  major 
conclusion  is  that  the  handbook  concepts  and  design  data  are 
adequate  for  their  intended  purpose.  There  is  indication  that 
some  optimization  for  a  particular  characteristic  is  possible, 
and  that  this  can  be  done  with  the  available  moment -method 
programs . 
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enter  reactance-resistance  plot  for  A  0.5  meter  dipole 
iameter,  from  150  to  1200  MHz,  step  50  MHz.  See  text. 
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Fig.  4.  SWR  for  the  Zepp  antenna  fed  with  50 
ohm  coax. 
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Fiq .  5.  Free  space  azimuth  plot  for  the  Zepp  antenna  at  300 
MHz. Both  antenna  radiation  and  line  leakage  are  present. 


Fig.  6.  Current  distribution  for  the  zepp 
antenna  at  600  MHz.  Note  the  similarity  of 
feed  line  current  to  that  of  Fig.  1. 


Fig.  7.  Current  distribution  on  an  Ell  or 
Marconi  antenna,  essentially  a  Zepp  with  one 
side  of  the  transmission  line  removed  and  the 
other  fed  against  ground.  Height  is  1/4  wave. 


Fig.  8.  Radiation  pattern  of  the  Ell  antenna.  The  antenna  lobe 
directed  upward,  with  the  single  wire  feed  acting  as  a  vertical 
radiator. 
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Fig.  10.  Azimuth  and  elevation  pattern  plots  for  antenna  of  Fig 
9- 
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Fig.  11.  Current  distribution  for  a  half-wave 
vertical  fed  with  a  collinear  1/4  wave  open 
transmission  line,  the  J-Pole  antenna. 
Compare  the  current  distribution  with  Fig.  1. 


Fig.  12.  Radiation  pattern  of  the  J-Pole  in  free  space. 
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Abstract-  In  this  paper  we  present  the  results  of  our  investigation  of  a  multi-element  antenna  design, 
which  was  inspired  by  Laufer  and  Ono  [1].  The  objective  of  this  effort  is  to  evaluate  the  suitability 
of  this  type  of  antenna  for  VHF  (30-88  MHz)  applications.  The  Numerical  Electromagnetics  Code 
(NEC4)  is  utilized  to  analyze  the  characteristics  of  the  antenna  system.  Three  different  excitation 
methods  were  studied  to  seek  improvement  in  overall  performance.  Some  representative  numerical 
results  are  presented  in  the  paper  and  conclusions  based  on  these  results  are  included. 

1.  Introduction 

Broadband  wire  antennas  have  received  a  great  deal  of  attention  in  recent  years  because  they 
have  found  widespread  applications  in  communication  systems.  Numerous  design  techniques  have 
been  proposed  during  the  last  few  years  [2,3]  to  improve  the  performance  of  wire  antennas  that  are 
inherently  narrowband  in  nature.  The  approach  described  in  this  paper  to  broadbanding  the  antenna 
is  to  employ  multiple  wire  elements  with  a  single  feed,  such  that  the  elements  resonate  at  different 
frequencies  to  cover  the  desired  frequency  range.  We  investigated  this  type  of  antenna  design  with  a 
view  to  evaluating  its  broadband  characteristics  and  suitability  for  a  mobile  communication  unit. 
Three  variants  of  the  multiresonant  antenna  system  have  been  studied  in  this  effort.  All  of  these  have 
an  identical  upper  portion  comprising  9  parallel  vertical  wires  of  varying  lengths.  Both  the  directive 
Gain  and  VSWR  characteristics  have  been  examined,  and  the  possibility  of  matching  the  antenna  to 
bring  the  VSWR  level  down  to  3.5: 1  or  better  has  been  investigated. 

2.  The  Antenna  Analysis  and  the  Results 

The  geometries  of  the  three  antennas  modeled  in  this  paper  are  shown  in  Fig.  1.  All  of  them 
have  an  identical  upper  section,  which  consists  of  a  set  of  nine  parallel  vertical  wires  (see  Fig.  1). 
The  lengths  of  the  wires  are  85cm,  92cm,  106cm,  120cm,  134cm,  148cm,  162cm,  176cm  and 
182cm,  respectively.  They  are  ranging  from  about  one-quarter  of  the  longest  wavelength  to  one- 
quarter  of  the  shortest  wavelength  in  the  30-88  MHz  frequency  band. 

As  mentioned  above,  to  look  for  an  appropriated  performance,  three  structures  with  different 
feeding  at  the  lower  part  of  the  antenna  (see  Figs,  la  though  lc)  were  investigated.  The  first 
structure  has  a  wire  loop  of  radius  r  =  5cm  (Fig.  la),  which  supports  the  9  vertical  wires  that  are 
placed  equidistantly  around  its  periphery  and  fed  through  one  wire.  The  lower  part  of  the  second 
configuration  studied  has  a  system  of  radial  spokes  r  =  5cm  that  are,  in  turn,  connected  to  the 
vertical  wires  as  shown  in  Fig.  lb.  Finally,  the  third  geometry  modeled  consists  of  a  system  of 
inclined  wire  segments  and  radials  that  form  a  truncated  cone-like  structure,  as  shown  in  Fig.  lc. 
The  radius  of  the  upper  and  lower  periphery  of  the  truncated  cone  are  R=5cm  and  r  =  3cm, 
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respectively.  All  of  these  three  antenna  structures  were  analyzed  via  the  NEC4  code.  Some  of  the 
performance  characteristics  of  the  antennas,  including  the  input  impedance,  input  power,  directive 
power  gain,  radiation  pattern  and  VSWR,  are  presented. 

Figures  2  and  3  depict  the  gain  and  VSWR  performances  of  the  three  antenna  structures.  It  is 
evident  from  these  figures  that  the  antennas  suffer  from  the  presence  of  multiple  resonances  in  the 
frequency  range  of  interest.  An  attempt  to  improve  the  VSWR  of  the  antenna  via  the  use  of  a 
matching  network  proved  futile  due  to  the  presence  of  a  large  number  of  extreme  peaks  and  valleys 
in  the  real  and  imaginary  parts  of  the  impedance.  It  was  determined  that  the  impedance  was  not 
matchable  without  paying  a  significant  penalty  in  terms  of  the  system  gain  performance.  Therefore 
we  conclude  that  the  antennas  shown  in  Fig.l,  based  on  our  analysis,  are  not  suitable  for  a  wideband 
communication  system. 

REFERENCE: 

1.  S.  Laufer  and  K.  Ono,  U.S.  Patent  4,302,760,  Nov.  1981. 

2.  M.  Bahr,  Boag,  E.  Michielssen  and  Raj  Mittra,  “Design  of  Electrically  Loaded  Wire  Antennas 

Using  Genetic  Algorithm,”  IEEE  APS  Trans.  Vol.  44,  No.5,  pp.  687-695,  1996. 

3.  P.  L.  Werner,  Z.  Atlman,  R.  Mittra,  D.  H.  Werner  and  A.  J.  Ferraro,  “Genetic  Algorithm 
Optimization  of  Stacked  Vertical  Dipoles  Above  A  Ground  Plane,”  Proceedings  of  IEEE  APS 
International  Symposium  1997,  pp.  1976-1979. 


u 


Fig.l.  Modeled  antenna  geometries. 
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Fig.2.  Gain  performances  of  three  antennas.  Solid  line  corresponds  to  the 
antenna  geometry  in  Fig.  la;  dashed  -  antenna  in  Fig.  lb;  short  dashed  - 
antenna  in  Fig.  lc.  Observation  point :  0=89.957°. 
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ABSTRACT 


Powerful  EM  codes,  originally  developed  for  the  computation  and  visualization  of  electromagnetic  fields  in 
charged-particle  accelerators,  are  currently  being  used  to  map  the  excitation  fields  of  Clustered  Phased  Arrays. 

The  2D  codes  currently  being  used,  were  originally  developed  for  large-scale  main-frames,  but  have  recently 
been  ported  to  high-end  PC-Workstations,  and  run  under  the  Windows  95  or  under  the  Windows  NT  4.0  operating 
systems.  Other  (much  larger)  3D  EM  accelerator  codes  are  currently  being  ported  from  the  original  Cray  or  Unix 
Workstation  environments. 

These  codes  have  been  found  to  be  ideally  suited  to  the  computation  and  graphic  visualization  of  the  internal 
excitation  fields  of  Clustered  Phased  Arrays.  These  excitation  fields  are  generated  by  the  linear  superposition  of 
multiple  traveling-waves  that  simultaneously  propagate  in  all  directions  through  a  two-dimensional  slow-wave 
guiding  structure  that  extends  in  a  plane  located  behind  and  parallel  to  the  whole  array  aperture,  thereby  feeding  all 
the  array  elements  with  signals  of  proper  amplitudes  and  phases. 

1  -  FUNDAMENTAL  OBJECTIVES. 

The  new  Clustered  Phased  Array  concept  was  conceived  and  is  being  developed  for  applications  to  high- 
directivity,  electronically-steered  microwave  antenna  systems,  as  a  remarkably  affordable  alternative  to  the 
notoriously  very  expensive  active  apertures. 

Clearly,  very  substantial  reductions  in  array  complexity  and  cost  can  be  attained  by  using  a  feed-network 
configuration  that  requires  a  much  smaller  number  of  beam-steering  controls,  in  particular  much  smaller  than  the 
very  large  number  of  radiating  array  elements  [1-8). 

Such  complexity  and  cost  reductions  are  actually  possible  because  of  the  intrinsically  very  large  redundancy  of 
beam-steering  degrees  of  freedom  exhibited  by  active  apertures.  Indeed,  in  active  aperture  arrays,  the  use  of 
numerous  element-level  active  modules  obviously  provides  the  quite  superfluous  capability  of  synthesizing  large 
numbers  of  totally  useless  aperture  distributions,  that  do  not  generate  any  practical  radiation  pattern. 

Remarkably  smaller  numbers  of  beam-steering  controls  become  sufficient  provided  that : 

a)  the  array  feed-network  used  achieves  a  pervasive  synergism  of  all  beam-steering  controls,  where  each  control 
affects  all  radiating  elements  and  each  element  is  affected  by  all  controls 

b)  only  the  required  aperture  distributions  are  generated,  and  . . . 

c)  the  elementary  aperture  distributions,  generated  by  all  the  beam-steering  controls  used,  are  linearly 
independent,  or  better  yet  mutually  orthogonal.  Ideally,  these  elementary  aperture  distributions  should  form  a 
complete  set  of  orthogonal  2D-Eigenfunctions,  spanning  the  vector-space  of  all  the  distributions  required  by  the 
specified  beam  steering  coverage. 
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A  continuing  theoretical  and  experimental  research  effort  is  currently  being  aimed  at  determining  the 
applicability  of  two-dimensional  wave-guiding  structures,  in  the  form  of  clusters  of  strongly  coupled  multiport 
cavity  resonators,  as  the  excitation-field  distribution-circuits  of  steered  phased-array  feed-networks. 

The  current  use  of  some  particle-accelerator  EM  codes,  for  the  computation  and  visualization  of  the  internal 
fields  of  the  cavity-cluster,  is  here  intended  to  complement  the  results  of  previous  extensive  network-theoretical 
analyses  [6-8],  and  experimental  measurements  [5],  by  providing  a  direct  computer-simulated  characterization  of 
both  single  multiport  cavity  resonators  as  well  as  of  a  number  of  increasingly  larger  clusters  of  unit-cell  resonators 
(See  figures  1-3). 

2  -  CAPABILITIES  OF  THE  2D  LOS  ALAMOS  LABORATORY  ACCELERATOR  CODES. 

The  set  of  2D  EM  field-analysis  codes,  known  as  the  POISSON/SUPERFISH  group  of  codes,  has  been  developed 
and  progressively  evolved  by  the  Los  Alamos  Laboratory  Accelerator  Codes  Group  ( LAACG ),  and  is  primarily 
intended  for  application  to  particle  accelerator  design  [9-10]. 

Various  components  of  the  set  are  nevertheless  sufficiently  general  to  be  applicable  in  the  analysis  of 
communication  antennas  and  phased  arrays. 

All  the  numerous  FORTRAN  77  programs  in  the  set  have  already  been  ported,  from  their  original  UNIX  and/or 
CRA  Y environment,  to  high-end  PC-Work  Stations  running  Windows  95  or  Windows  NT. 

Most  remarkably,  the  POISSON/SUPERFISH  group  of  codes  is  available  free  of  charge ,  upon  request,  from  the 
Los  Alamos  Laboratory  Accelerator  Codes  Group  (LAACG). 

The  group  continuously  improves  the  code  functionality,  and  recurrently  updates  the  related,  very  detailed 
documentation  |9].  The  following  is  a  very  short  description  of  only  those  electromagnetic  field-analysis  capabilities, 
that  are  conveniently  applicable  to  communication  antenna  and  phased  array  problems. 

2.1  -  The  Mesh  Generator. 

The  AUTOMESH  program,  is  the  most  fundamental  component  of  the  POISSON/SUPERFISH  group  of  codes, 
and  has  advanced  capabilities  for  generating  and  optimizing  discrete  triangular  meshes  with  selectively  controllable 
resolution  across  any  given  two-dimensional  EM  field-domain. 

The  RF  structure  to  be  analyzed  and  the  EM  field-domain  can  be  geometrically  outlined  in  either  a  2D  Planar- 
Cartesian  or  a  circularly-symmetric  Cylindrical  reference  frame  (in  which  case  an  implied  circular  symmetry  of 
both  the  structure  and  the  EM  fields  is  assumed). 

The  maximum  total  number  of  internal  mesh-nodes  is  only  limited  by  the  size  of  RAM  and  Virtual  Memory 
available  in  the  system  used.  Further,  different  regions  of  the  total  field-domain  can  be  geometrically  outlined,  and 
selectively  meshed  with  the  most  appropriate  resolutions  for  a  faithful  display  of  the  expected  field  distributions. 

The  generated  mesh  is  automatically  optimized,  by  a  process  of  successive  over-relaxations  that  repetitively 
reshapes  the  mesh,  aiming  at  one  composed  of  quasi-equilateral  triangles,  and  at  the  same  time  prevents  the 
generation  of  triangles  with  large  aspect  ratios. 

Also  the  input  data,  defining  the  geometry  of  the  RF  structure  to  be  analyzed,  are  automatically  checked  for 
physical  consistency. 

The  final  configuration  of  the  mesh  is  saved  in  a  binary  file,  for  subsequent  graphic  display  and  further  use  in  the 
electromagnetic  field  solver  programs  FISH  and  CFISH. 

2.2  -  The  Electromagnetic  Field  Solvers. 

The  mesh  generated  by  the  AUTOMESH  program  can  be  visualized  by  means  of  the  2D  graphic  program 
VGAPLOT,  and  is  further  used  in  either  of  the  two  RF  field  solver  programs  FISH  and  CFISH.  These  RF  field 
solvers  compute  the  electromagnetic  field  values  at  the  given  mesh  nodes,  by  numerically  solving  Maxwell's 
equations. 
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While  the  FISH  code  only  computes  the  local  field  amplitudes ,  and  is  mainly  used  for  searching  for  and 
identifying  resonant  structure  modes,  the  CFISH  code  actually  computes  the  local  real  and  imaginary  components  of 
the  fields,  and  can  therefore  map  traveling-wave  propagation  processes  in  non-resonant  systems. 

The  RF  field  solver  FISH  accepts,  as  inputs,  Cartesian  X  and  Y  coordinates  specifying  the  location  of  a  single 
point-source.  The  RF  field  solver  CFISH,  however,  can  even  accept  the  locations,  amplitude-  and  phase-settings  of 
multiple  RF  sources  ( n  <  5),  that  can  either  be  point-sources  or  line-sources. 

In  particular,  the  CFISH  program  can  analyze  the  complex  EM  fields  generated,  at  any  specified  fixed 
frequency,  by  multiple  RF  line-sources  (/i  <  5),  each  line-source  having  different  amplitude-  and  phase-settings.  The 
CFISH  program  provides  thus  the  capability  of  simulating  the  simultaneous  propagation  of  multiple  sets  of  traveling 
waves,  in  all  possible  directions,  through  two-dimensional  wave-guiding  structures. 

Further,  using  the  CFISH  program,  different  regions  of  the  field-domain  can  be  geometrically  outlined  and 
electrically  specified  by  selectively  assigning  the  local  values  of  the  real  and  imaginary  parts  of  the  permittivity  s’ 
and  e",  and  those  of  the  permeability  (J.’  and  fl",  thus  simulating  complex  media  and  absorbing  boundaries. 

Either  Dirichlet  or  Neuman  boundary  conditions  can  be  assigned  to  selected  external  or  internal  field-interfaces, 
in  both  the  FISH  and  CFISH  EM  field  solver  programs. 

The  RF  field  solver  programs  FISH  and  CFISH  append  the  array  of  computed  field  values,  in  binary  form,  to 
the  AUTOMESH  output  file.  That  binary  file  is  used  as  both  a  repository  of  the  output  data  sets  generated  by  the 
various  programs,  and  as  a  communication  link  between  the  different  programs  being  run  in  any  given  simulation. 

2.3  -  Using  the  E  and  H  Fields  in  Complementary  Roles. 

In  the  analysis  of  some  EM  problems  in  Cartesian  geometry,  where  the  X-Y  reference  plane  is  coincident  with 
the  tf-plane  of  the  field  (with  an  implied  Hz  -  0  component),  the  physical  roles  of  the  electric  and  magnetic  fields  E 
and  H  may  be  mutually  exchanged,  by  actually  solving  the  dual  EM  problem. 

The  dual  EM  problem  is  then  defined  by  switching  the  type  of  boundary  condition  used  along  the  externa! 
boundaries  of  the  field-domain,  from  either  Dirichlet  to  Neuman  or  from  Neuman  to  Dirichlet. 

This  expedient  circumvents  the  intrinsic  inability  of  both  the  FISH  and  CFISH  EM  field  solver  programs  to 
handle  problems,  set  in  a  Cartesian  reference  frame,  that  are  a-priori  known  to  lead  to  a  field  solution  having  zero 
Hz  component.  A  typical,  fundamental  example  is  given  by  the  propagation  of  TE- modes  in  a  rectangular 
waveguide  that  has  its  broadside  walls  parallel  to  the  X-Y  reference  plane.  The  dual  EM  problem  is  then  defined  by 
imposing  a  Neuman,  rather  than  a  Dirichlet,  boundary  condition  along  the  traces  of  the  waveguide  short-walls  on 
the  X-Y  reference  plane,  thus  implying  these  to  behave  as  magnetic  rather  than  electric  walls. 

2.4  -  Field  Visualization. 

The  2D  graphic  program  VGAPLOT  can  be  used  to  visualize  either  the  triangular  mesh  generated  by  the 
AUTOMESH  program,  or  to  generate  contour  plots  of  the  fields.  The  contour  lines  generated  from  the  resonant¬ 
mode  analyses  performed  by  the  FISH  code  represent  constant  field  amplitudes,  while  two-color  contour  plots  of  the 
real  and  imaginary  field  components  are  generated  from  the  complex-field  analyses  performed  by  the  CFISH  code. 

In  a  Cartesian  reference  frame,  the  VGAPLOT  graphic  program  displays  contours  of  either  the  HZ,EX  ,  or  the 
Ey  component,  while  in  a  circularly-symmetric  Cylindrical  reference  frame  the  program  displays  contours  of  either 
the EZ,H^  , or  the HR component. 

The  values  of  all  the  computed  field  components  can  be  interpolated  along  arbitrary  lines  cutting  through  any 
given  region  of  the  EM-field  domain. 

2.5  -  Post-processing  Capabilities. 

The  graphic  display  program  VGAPLOT  interpolates  the  computed  mesh-node  field-values  in  the  selected  two- 
dimensional  reference  frame,  to  generate  smoother  contour-lines  of  the  real-field  amplitudes  computed  by  the 


74 


program  FISH,  or  of  the  real  and  imaginary  parts  of  the  complex  fields  computed  by  the  program  CFISH.  The 
contour-lines  of  the  real  and  imaginary  parts  are  displayed  in  different  colors. 

Further,  the  post-processing  program  SF7  can  be  used  to  interpolate  the  computed  mesh-node  field-values  along 
straight  segments,  arcs  of  circle,  user-defined  curves,  and  even  on  arbitrarily-sized  rectangular  regions,  with  a  user- 
selected  resolution  along  the  lines,  or  along  the  X-  and  F-dimensions  of  the  rectangular  regions.  The  interpolator 
routine  uses  the  field  values  computed  on  the  network  nodes,  including  the  first-  and  second-nearest  nodes  to  the 
given  interpolation  path.  Attempts  at  interpolating  with  high-resolution  on  a  coarse  mesh,  however,  will  obviously 
generate  jagged  results. 

Most  remarkably,  the  post-processing  program  SFO  can  be  used  to  integrate  the  local  field  gradient,  along  the 
same  types  of  lines  used  for  SF7  interpolation.  The  performed  line-integration  computes  thus  the  corresponding 
total  potential  change  between  the  line  end-points.  This  capability  makes  the  computation  of  equivalent  total  voltages 
and  currents  possible,  in  particular  along  the  interface  lines  that  define  the  physical  locations  of  the  ports  of  a  multi- 
port  device.  These  interface  lines  should  be  carefully  selected  to  coincide  with  appropriate  phase  contour-lines,  that 
can  be  identified  by  generating  phase-contour  displays.  None  of  the  POISSON/SUPERFISH  £Af-solvers  and  post¬ 
processors  has  the  capability  of  generating  such  phase-contours,  but  these  can  be  easily  obtained  by  exporting  the 
solution  data  as  described  here  below.  Specially-defined  transit-time-integrals  can  also  be  computed,  for  the  complex 
fields  generated  by  the  program  CFISH,  thus  taking  into  account  possible  phase-differences  between  the  various 
points  along  the  integration  path.  The  transit-time-integrals  were  included  in  the  SFO  post-processor  to  compute  the 
amounts  of  energy  gained  by  charged  particles  being  accelerated  by  the  field,  and  for  this  purpose  apply  a  given 
relativistic  velocity  factor  J3  =  V  /  C  <  1.  For  purely  electromagnetic  computations,  however,  aiming  at  obtaining  the 
values  of  equivalent  impedances,  and  of  the  active  or  reactive  power  exchanged  though  a  given  interface  line,  as  well 
as  for  the  computation  of  scattering  parameters,  the  value  of  3  should  be  set  equal  to  the  relative  phase-velocity  of  the 
propagating  wave  v9,  -  vp  /  c . 

The  results  of  the  field  interpolations,  generated  by  the  post-processing  program  SF7  on  rectangular  regions,  are 
written  to  an  output  ASCII  file  that  may  include  ail  the  interpolated  values  of  the  amplitudes,  the  real  and  the 
imaginary  parts  of  the  Cartesian  components  of  the  E  and  H  fields,  and  those  of  the  corresponding  total  fields.  The 
tabulations  sections  of  such  ASCII  files  have  lines  of  constant  record-length,  that  can  be  extracted  with  any  ASCII-file 
editor,  saved  as  flat  ASCII  files,  and  loaded  into  MATLAB.  The  imported  field-values  can  then  be  used  as  initial  data 
in  very  extensive,  and  more  specialized  post-processing  computations.  Further  the  obtained  results  can  be  easily 
displayed  in  various  2D-  and  5Z)-formats,  by  using  the  powerful  built-in  graphic  capability  of  the  MATLAB 
program.  Most  remarkably,  3Z>-di$plays  and  contour-line  plots  of  the  field  phases  can  be  used  to  visualize,  and 
quantitatively  identify  the  shapes  and  locations  of  the  phase-fronts  of  relevant  propagating  modes.  Ultimately, 
advanced  post-processing  may  enable  the  computation  of  the  complex  coefficients  of  modal-expansions, 
appropriately  selected  to  match  the  fundamental  symmetries  of  the  fields  and  of  the  outer  boundaries. 

The  post-processing  program  SF7  can  also  be  used  to  compute  the  total  absorbed  active  power  in  any  given 
rectangular  region,  including  conductor  losses  if  any  and  the  power  absorbed  in  regions  filled  with  materials  having 
complex  permittivity  and/or  permeability.  Most  remarkably,  the  active  power  dissipated  in  simulated  absorbing 
boundaries  can  be  computed  by  confining  the  complex  media,  used  in  such  simulation,  to  regions  delimited  by 
lossless  outer  walls.  This  capability  provides  yet  another  way  of  computing  the  amounts  of  active  power  propagated, 
through  multi-port  microwave  devices,  by  traveling  guided  waves. 

2.6  -  Required  Processing  environment. 

The  already  ported  Fortran  programs  of  the  POISSON/SUPERFISH  group  of  codes  can  not  be  strictly 
considered  Windows  95  or  Windows  NT  programs,  but  can  nevertheless  be  run  under  control  of  either  Windows  95 
or  Windows  NT.  The  preferred  environment  is  a  DOS  Window  of  either  operating  system,  where  the  benefits  of 
enhanced  DOS  support  is  provided  by  either  of  the  two  Windows  Operating  Systems  to  all  the  programs  that  do  not 
directly  access  the  system  hardware.  New  versions  of  these  programs,  that  run  under  Windows  95  with  GUI  control 
are  being  developed  by  the  LAACG ,  and  are  actually  already  available  as  Beta  releases. 
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While  in  the  DOS  Window,  all  these  programs  run  in  character-mode,  and  are  basically  command-line  controlled, 
accepting  (in  an  admittedly  rather  old-fashioned  way)  typed  commands  from  the  keyboard  (only  occasionally)  or 
(mostly)  from  control  files.  The  control  files  completely  automate  the  numerical  solution  of  any  given 
electromagnetic  problem,  by  executing  the  required  programs  in  their  logical  order,  and  in  the  most  appropriate 
mode  for  the  given  problem.  The  fundamental  control  file  is  the  SVPERFISH  initialization  file  SF.INI,  that  globally 
sets  almost  all  the  control  parameters  for  the  various  programs  of  any  given  run.  Additional  control  files,  however, 
are  also  used  to  fine-tune,  modify  and/or  override  the  global  default  settings  for  any  specific  program  in  a  given  run. 
The  synergism  of  the  fundamental  initialization  file  SF.INI,  and  of  the  additional  control  files  indeed  provides  great 
flexibility,  almost  comparable  to  that  of  modern  point-and-click  menus,  but  this  flexibility  can  however  only  be 
exploited  by  manually  editing  the  control  files  beforehand,  and  by  carefully  understanding  the  mutual  interactions 
between  the  various  controls. 

The  graphic  programs  of  the  set,  that  directly  access  the  video  display  of  the  system,  are  the  only  ones  that  must 
be  run  in  real  DOS-MODE,  with  no  GUI  support. 

Essentially  all  the  Fortran  programs  of  the  POISSON/SUPERFISH  group  of  codes  use  as  much  of  the  available 
system  Extended  Memory  and  Virtual  Memory  as  needed,  with  DOS  Protected  Mode  Interface  (DPMI)  support. 
This  capabilities  make  possible  the  numerical  simulation  of  very  large  EM  problems,  where  the  total  extent  of  the 
mesh  can  even  include  hundreds  of  thousands  of  mesh-nodes. 

The  auxiliary  graphic  display  programs  that  must  run  in  real  DOS-Mode,  use  a  VCP1  DOS-Extender 
(PharLap.386)  to  access  the  available  Extended  Memory  of  the  computer  system. 

The  intermediate  results,  generated  by  the  various  programs  in  an  automated  control-file-driven  run,  are  passed 
to  the  next  program  in  line  by  progressively  appending  them  to  a  common  binary  solution  file.  The  binary  file  is  used 
both  as  the  output  file  of  a  program  and  as  the  input  file  for  the  next  program  in  the  given  running-order. 

The  installation  procedure  provided  with  the  Fortran  programs  in  the  POISSON/SUPERFISH  group  of  codes 
only  extracts  executable  files  from  a  compressed  archive,  and  stores  them  in  a  newly-created  folder,  without 
affecting  any  of  the  Windows  System  files.  The  PATH  environmental  variable  must  therefore  be  manually  modified, 
by  editing  the  AUTOEXEC.BAT  file,  if  one  such  file  exists.  Alternatively,  the  Property  Settings  of  the  individual 
programs  may  be  appropriately  edited,  or  ad-hoc  path-statements  may  be  inserted  as  a  first  fine  in  the  *.bat  file  used 
to  run  any  given  simulation. 

The  Property  Settings  of  each  individual  program  are  accessed  by  highlighting  the  file  icon  in  the  Windows 
Explorer,  by  then  right-clicking,  and  by  selecting  the  Properties  entry  of  the  right-click  pop-up  menu. 

All  the  executable  program  files  must  be  kept  in  the  same  folder  (normally  the  newly-created  default  folder), 
because  of  undocumented  functional  interdependence  between  programs. 

3  -  THE  SINGLE-CAVITY  RESONATOR  MODEL. 

Figure  1  shows  the  geometry  of  a  single,  six-port  cylindrical  cavity  resonator,  intended  to  be  the  unit-cell  of  a 
large  hexagonal  cluster  of  mutually-identical,  directly-coupled  resonators.  The  model  is  here  represented  in  a 
Cartesian  reference  frame,  where  the  A/Y-plane  is  orthogonal  to  the  cavity  axis  and  is  also  coincident  with  the  H- 
plane  of  six  rectangular  waveguides.  The  waveguides  extend  radially  at  60°  in  azimuth  from  each  other  around  the 
cavity  axis.  The  shape  of  the  outer  boundaries,  and  the  internal  electromagnetic  field  pattern,  generated  by  any 
given  excitation,  must  be  assumed  uniform  in  the  direction  of  the  Z-axis.  This  assumption  is  obviously  required  by 
the  restriction  of  the  presently  available  POISSON/SUPERFISH  £M-analysis  programs  to  two  dimensional  EM-field 
simulations.  The  Z-axis  is  coincident  with  the  cavity  axis.  The  cavity  is  assumed  to  be  air-filled,  while  the  six 
rectangular  waveguides  are  filled  with  a  lossless  dielectric.  The  permittivity  value  of  the  waveguide  dielectric  is 
sufficiently  high  to  make  the  TE,0  cut-off  frequency  of  the  waveguides  sufficiently  lower  than  the  TM0l0  resonant 
frequency  F0  of  the  cavity  resonator,  so  that  the  TEI0  waveguide  mode  is  propagating  (above  cut-off)  at  the  resonant 
frequency  F0  of  the  cavity,  while  its  /f-plane  is  orthogonal  to  the  cavity  axis,  and  coincident  with  the  i/-plane  of  the 
TM010  resonant  cavity  mode.  Because  of  the  assumed  uniformity  of  the  fields  in  the  direction  of  the  Z-axis,  the 
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simulated  cavity  mode  actually  is  a  TM0l  mode,  at  cut-off  with  respect  to  propagation  along  the  cavity  axis.  The  six 
waveguides  are  closed  on  wedge-shaped  matched  loads,  simulated  by  triangular  regions  filled  with  high-loss 
dielectric  material  having  a  wave-impedance  equal  to  that  of  the  TE]0  fundamental  waveguide  mode.  A  single  line- 
source  is  located  in  the  arbitrarily-selected  waveguide  No.  1 ,  at  sufficient  distance  from  both  the  No.  1  waveguide¬ 
load,  and  from  the  arc  of  circle  that  represent  the  input  interface  of  the  resonator.  The  simulation  model  shown  in 
Figure  1  represents  only  an  equatorial  cross-section  of  the  actual  shape  selected  for  the  unit-cell  of  the  phased-array 
cavity-cluster.  The  full  three  dimensional  geometry  of  the  actual  unit-cell  is  shown  in  Figure  3.  The  six  ports  of  the 
resonator  are  there  six  H-shaped  irises,  as  dictated  by  the  two  simultaneous  requirements  that:  a)  the  irises  be 
resonant  on  the  TM0I0  resonant  frequency  F0  of  the  resonator,  and  b)  that  the  azimuth  span  of  the  coupling  irises  be 
sufficiently  smaller  than  60°  ,  so  that  six  irises  may  fit  around  the  mid-section  of  the  resonator,  with  adjacent  irises 
having  negligible  direct  mutual  coupling. 

In  the  Cartesian  reference  frame  used,  the  complex-field  electromagnetic  solver  CFISH  always  computes  on  the 
mesh  nodes  the  values  of  the  Hz  field  component  oriented  along  theZ-axis.  In  the  given  physical  situation,  however, 
the  Hz  field  component  is  identically  zero,  so  that  here  the  mentioned  expedient  of  exchanging  the  roles  of  the  E  and 
H  fields  must  be  used  to  obtain  a  practical  solution.  As  already  described,  the  expedient  of  using  the  E  and  H  fields 
in  mutually  complementary  roles  consists  actually  in  posing  and  solving  the  dual  of  the  actual  physical  problem. 
This  means  that  the  propagating  waveguide  mode  must  be  assumed  to  be  the  TM10  mode,  that  has  a  non-zero  Hz 
field  component  and  its  .E-plane  coincident  with  the  A7F-plane  of  the  simulation.  In  this  dual  representation  of  the 
actual  physical  situation,  contour-lines  of  the  Hz  component  are  shaped  as  those  of  the  Ez  component  in  the  actual 
physical  model.  The  dual  representation  is  simply  obtained  by  replacing  the  external  electric-walls  of  the  cavity 
resonator,  and  of  the  waveguides,  with  fictitious  magnetic-walls.  This  means  setting  the  boundary  condition  along 
the  wall  of  the  cavity  resonator  and  along  those  of  the  waveguides  as  Dirichlet  (representing  a  magnetic  wall)  rather 
than  Neumann  (representing  an  electric  wall).  The  obtained  field  solutions  must  obviously  be  re-interpreted  in  terms 
of  the  physical  problem,  after  performing  the  computations.  This  means  converting  from  Ampere/meter  values,  of 
the  computed Hz  component,  to  Volt/meter  values  of  the  physical  Ez  component. 

Simulated  excitation  of  the  single-cavity  model  of  Figure  1  is  easily  obtained  by  defining  a  magnetic-field  line- 
source,  of  uniform  intensity  and  phase,  located  across  the  No.  1  waveguide  at  a  sufficient  electrical  distance  from  the 
matched  load  on  one  side  and  from  the  cavity  resonator  port  on  the  other.  Both  the  amplitude  in  Ampere/meter  and 
the  phase  in  degrees  can  be  specified,  with  the  source-phase  used  as  a  common  reference  for  the  position-dependent 
phases  of  the  electromagnetic  field  everywhere  else  within  the  cavity  and  its  waveguides.  Further,  the  post¬ 
processing  program  SFO  can  be  used  to  normalize  the  local  values  of  the  fields,  everywhere  within  the  cavity  and  the 
waveguides,  to  any  reference  value  arbitrarily  assigned  to  the  line-integral  of  the  field,  computed  along  the  X/Y- 
plane  trace  of  the  port  No.  1  interface.  The  normalization  applies  then  a  complex  scale  factor  to  the  local  field  values, 
that  makes  them  consistent  with  the  selection  of  port  1  as  the  zero-phase  reference-location. 

The  active  power  absorbed  by  each  of  the  six  simulated  matched  loads  (or  perfectly  absorbing  layers:  PML)  can  be 
computed  by  geometrically  defining  rectangular  interpolation  region,  that  fully  cover  each  of  the  waveguide  end- 
sections.  The  post-processing  program  SFO  can  then  be  used  to  integrate  the  active  power  absorbed  within  each 
rectangular  region,  which  equals  the  total  power  absorbed  by  each  of  the  simulated  matched  loads  if  the  waveguide 
walls  are  defaulted  to  being  lossless.  Further,  the  input  active  power  at  port  No.  1  can  also  be  computed  as  the  real 
part  of  the  complex  product  of  the  E-  and  /f-field  line-integrals,  along  the  arc  of  circle  that  represents  the  A7F-plane 
trace  of  the  port  No.  1  interface.  The  five  ratios  of  the  active  powers  absorbed  by  each  of  the  simulated  matched- 
loads  No.  2  through  No.  6,  to  the  input  active  power  at  port  No.  1  must  all  be  mutually  equal,  because  of  representing 
the  square  of  the  magnitude  of  the  common  value  of  all  the  cross-term  S y  =  S}i  of  the  resonator  scattering  matrix. 
Indeed,  as  a  consequence  of  the  C6V  rotation-a nd-refiection  symmetries  of  the  six-port  unit-cell  resonator,  its  6  x  6 
scattering  matrix  is  a-priori  known  to  be  simultaneously  symmetric  about  its  main  diagonal,  and  circulant,  so  that:  a) 
the  six  elements  of  its  first  row  are  the  only  different  element  of  the  whole  matrix,  and  b)  the  elements  symmetrically 
located  about  the  main  diagonal  are  mutually  equal.  The  combination  of  the  geometric  and  reciprocity  symmetries 
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reduces  the  total  number  of  different  scattering  matrix  elements  from  62  =  36  to  only  two,  the  two  being  and  Sy  . 
The  a-priori  knowledge  implied  by  this  symmetries  provides  a  rigorous  theoretical  basis  for  enhancing  the  accuracy 
of  the  results  of  numerical  field  simulations  by  way  of  least-squares-fitting. 

4  -  THE  HEXAFOLIUM  :  A  FIRST  18-PORT  RESONATOR  CLUSTER. 

Figure  2  shows  the  geometry  of  the  simplest  model  of  cavity-resonator  cluster,  that  includes  only  seven  mutually- 
identical  six-port  cylindrical  cavity  resonators.  The  seven  resonators  are  the  unit-cells  of  the  smallest  coupled- 
resonator  cluster  that  can  be  considered  as  a  two-dimensional  wave-propagation  medium.  Two  concentric  tiers  of 
unit-cells  can  be  identified,  the  first  consisting  of  the  single  central  resonator,  and  the  second  composed  of  the  outer 
ring  of  six  resonators.  The  seven-resonator  cluster  of  Figure  2  exhibits  the  same  hexagonal  C6V  rotation- and- 
reflection  symmetries  of  each  of  its  six-port  unit-cells,  and  represent  the  simplest  prototype  of  an  hexagonal  cluster. 
In  the  illustration  of  Figure  2,  the  model  is  represented  in  a  Cartesian  reference  frame,  where  the  X1Y- plane  is 
orthogonal  to  the  axes  of  the  seven  cavities  and  is  also  coincident  with  the  //-plane  of  the  cavity  TM0I  mode,  of  the 
TE,0  mode  of  the  18  external  waveguides  and  that  of  the  six  resonant  irises  of  the  central  cavity.  The  simulated  18 
radial  waveguides  and  the  six  irises  are  filled  with  lossless  dielectric,  as  in  the  single-resonator  model  of  Figure  1.  As 
a  consequence  of  the  C6V  rotation-nnA-reflection  symmetries  of  the  cluster  its  18  x  18  scattering  matrix  is 
simultaneously  symmetric  about  its  main  diagonal,  and  block-circulant  with  circulant  blocks,  so  that:  a)  the  18 
elements  of  its  first  row  are  the  only  different  element  of  the  whole  matrix,  b)  the  5x5  matrix  blocks  that  are 
symmetrically  located  about  the  main  diagonal  are  the  transpose  of  each  other,  and  c)  the  four  different  5x5 
matrix  blocks  are  all  circulant.  The  combination  of  the  geometric  and  reciprocity  symmetries  reduces  the  total 
number  of  different  scattering  matrix  elements  from  182  =  324  to  only  Jive,  the  five  being  S„  ,  Sl2  ,  S,4  ,  S]7  ,  and 
SIiJ0  16].  Again,  the  a-priori  knowledge  implied  by  this  symmetries  provides  a  rigorous  theoretical  basis  for  vastly 
enhancing  the  accuracy  of  the  results  in  numerical  field  simulations  by  way  of  least-squares-fitting. 

Equivalent  voltages  and  currents  at  the  18  external  ports  can  be  obtained  from  the  post-processing  program 
SFO,  by  computing  the  line-integrals  of  the  local  field  gradients  along  the  18  arcs  of  circle  that  represent  the  X/Y- 
plane  traces  of  the  input-port  interfaces  to  the  external  waveguides.  Similarly,  the  equivalent  voltages  and  currents 
at  the  six  internal  ports  of  the  central  resonator  can  be  obtained  by  computing  the  line-integrals  of  the  local  field 
gradients  along  six  straight  segments,  drawn  halfway  through  the  finite  thickness  of  the  irises  of  the  central 
resonator.  The  segments  represent  the  ATF-plane  traces  of  the  interfaces  between  the  inner  and  outer  tier  of 
resonators. 

The  post-processing  program  SFO  can  also  be  used  to  compute  the  total  active  power  dissipated  on  the  walls  of 
the  seven  cavity  resonator,  by  assuming  the  walls  to  have  a  finite  value  of  surface  resistivity.  Particular  attention 
must,  however,  be  paid  to  the  fact  that,  when  solving  complex-field  electromagnetic  problems  with  the  CF1SH 
program  while  at  the  same  time  using  the  E  and  H  fields  in  complementary  roles,  by  default  SFO  omits  Dirichlet 
boundaries  from  power  and  local  field  calculations,  unless  it  is  specifically  directed  to  include  such  boundaries.  This 
can  be  easily  done  by  setting  the  specific,  logical  control-variable  IncludeDirichletSegs  equal  to  Yes,  in  the 
SUPERFISH  initialization  file.  The  SFO  program  includes  four  different  options  for  specifying  the  surface 
resistance  of  conductors,  depending  on  whether  these  are  normal  conductors  or  superconductors,  and  whether  the 
default  values  for  copper  or  a  user-specified  resistivity  value  is  to  be  used.  Generally,  the  E  and  H  fields  are  used  in 
complementary  roles  to  find  the  solution  for  a  TE  cavity  mode,  in  which  case,  the  roles  of  £  and  H  are  interchanged. 
The  listed  values  of  the  £-field  in  the  SFO  output  file  actually  correspond  to  the  magnetic  field  H  of  the  TM  mode. 

5  -  SCATTERING  PARAMETER  EXTRACTION  AND  VALIDATION. 

The  complex,  multi-dimensional  scattering  matrices  S  of  multi-port  single  resonators,  and  that  of  resonator- 
clusters  of  relatively  moderate  extent  can  be  directly  computed  from  the  complex  field  data  generated  by  the  CFISH 
program.  This  can  be  done  by  first  interpolating  and  integrating  the  complex  E-  and  //-fields  along  the  input  and 
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output  interface-lines  between  the  considered  device  and  its  external  waveguides.  Both  the  E-  and  //-field  have  there 
constant  phase  if  the  selected  interface  line  is  coincident  with  the  phase-fronts  of  the  traveling  input-  or  output-wave 
at  the  considered  port.  The  phase-front  can  easily  be  identified  by  generating  phase-contour  displays,  as  discussed 
above. 

The  obtained  complex  integrals  represent  the  equivalent  port-voltages  and  equivalent  port-currents,  so  that  the 
complex  ratios  of  these  integrals  represent  equivalent,  local  complex  impedances.  Further,  the  real  parts  of  the 
voltage  x  current  products  represent  the  active  input  or  output  power  at  the  given  port,  while  the  imaginary  parts  of 
the  products  represent  the  reactive  power  being  exchanged  at  the  given  port  between  the  device  and  the  selected 
external  waveguide.  These  results  lead  to  the  computation  of  the  complex,  equivalent,  multi-dimensional  impedance 
matrix  Z  of  the  considered  device.  The  results  obtained  from  field-simulation  data  can  then  be  directly  compared  to 
those  of  previous  experimental  measurements,  performed  on  physical  models  of  single  cavity  resonators,  and  of 
resonator  clusters  of  moderate  extent  [5). 

An  essentially  infinite  number  of  different  multi-dimensional  scattering  matrices  S  of  multi-port  single 
resonators,  or  of  a  resonator-clusters  can  be  computed  from  the  obtained  unique  impedance  matrix  Z ,  depending 
on  the  selected  basis  of  impedance  normalization.  The  basis  of  the  normalization  could  in  principle  be  any  arbitrary 
complex  impedance  matrix  Z0 ,  of  the  same  size  as  the  impedance  matrix  Z ,  used  as  a  parameter  in  the  definition  of 
the  sets  of  in-going  and  out-going  waves,  to  and  from  some  given  multi-port  network.  In  common  microwave 
engineering  practice,  however,  only  two  choices  have  any  useful  physical  significance:  a)  a  real  diagonal  Z^-matrix 
with  all  its  main-diagonal  elements  equal  to  the  wave-impedances  Z0i  of  the  set  of  external  waveguides  used  in  the 
numerical  field  computation,  and  b)  the  auto-normalization  matrix  Z0}  ,  that  is  generally  complex,  and  block- 
diagonal  with  as  many  non-diagonal  blocks  as  there  are  separately-identified  sets  of  network-ports,  each  set  being 
considered  as  a  multi-port  interface  (l-2j.  The  scattering  matrix  S0  normalized  to  the  matrix  Z0  includes  the  wave 
reflection  coefficients  Si;  and  the  wave  transmission  factors  Sy  that  characterize  the  response  of  the  given  multi-port 
network  when  connected  to  external  waveguides  having  the  Z0i  wave-impedances.  Conversely,  the  scattering  matrix 
S,  normalized  to  the  matrix  Z0,  is  characterized  by  having  identically-zero  blocks  along  its  main  diagonal  and  off- 
diagonal  elements  Sy  that  characterize  the  response  of  the  given  multi-port  network  when  connected  to  a  set  of 
external  unconditionally-matched  load-networks.  The  scattering  matrices  S0  normalized  to  the  matrix  Z0  and  the 
scattering  matrix  S ,  normalized  to  the  matrix  Zm  are  mutually  related  by  way  of  the  scattering  matrix  re¬ 
normalization  transform  [30]. 

6  -  NETWORK  ANALYSIS  OF  LARGER  CLUSTERS. 

The  mathematical  problem  of  generating  closed-form  expressions  of  the  characteristic  Z  ,  ABCD  ,  S  or  T 
parameter-matrices  of  large  two-dimensional  clusters,  using  as  inputs  the  scattering  matrices  of  multi-port  unit-cell 
networks,  has  been  already  solved,  by  applying  advanced  methods  of  multi-dimensional  linear  algebra  [Ref.  5-8  and 
16-20],  and  methodically  exploiting  the  intrinsic  geometric  and  electromagnetic  symmetries  of  the  clusters.  Recursive 
solutions  are  readily  available  for  clusters  having  square  |17]  or  hexagonal  [6]  lattices,  and  for  two-dimensional 
structures  having  two  mutually-coupled  layer,  resembling  stacked-honeycombs  [8].  These  stacked-honeycomb,  two- 
dimensional  structures  include  multiple  meandering  wave-paths,  that  alternate  between  the  two  cavity-resonator 
clusters  of  the  top  and  bottom  layer  (Figure  4).  Such  structures  are  a  two-dimensional  generalization  of  the  well- 
known,  one-dimensional  side-coupled  cavity-resonator  structure  used  in  some  high-current  charged-particle  linear 
accelerators.  Extensive  graphic  visualizations  of  the  field-simulation  results  obtained  are  to  be  presented  in  session. 
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ABSTRACT 

In  recent  years  many  electromagnetic  (EM)  analysis  techniques  and 
computer  codes  have  been  developed  to  predict  radiation  patterns  of  various 
antenna  types  in  free  space  or  radome  effects.  However,  it  is  still  a  difficult  task  to 
predict  total  installed  performance  of  large  airborne  antenna  systems  due  to  lack 
of  efficient  computer  codes  for  both  airframe  interactions  and  radome  effects. 

Thus,  we  developed  an  extended  ray-optical  technique,  which  could  efficiently 
predict  total  installed  performance  of  large  airborne  antenna  systems,  i.e.,  the  near 
field  scattering  from  aircraft  structures  and  radome  effects.  The  new  analysis 
technique  utilizes  a  ray-tracing  method  in  the  radome  simulation  and  incorporates 
the  radome  effects  into  ray  optical  fields  in  the  airframe  scattering  analysis  code. 

Since  the  same  ray  technique  is  used  on  both  airframe  and  radome  simulations, 
the  new  analysis  code  can  efficiently  predict  total  installed  performance  of  large 
airborne  antenna  systems  with  arbitrary  shaped  radomes. 

1.0  INTRODUCTION 

Until  recently  most  airborne  antenna  systems  were  designed  and  tested  in  a  free  space 
environment  without  considering  the  near  field  scattering  effects  from  host  structures  and  radome 
effects.  However,  these  higher  order  effects  on  airborne  antenna  systems  are  the  matter  of  increasing 
concern  with  added  performance  demands  in  the  ever  increasing  jammer  and  clutter  interference 
environments.  Typical  effects  of  the  near  field  scattering  on  airborne  antenna  systems  are  radiation 
pattern  distortion,  reduced  gain,  increased  sidelobe  levels,  and  increased  clutter  returns  through 
multipath.  Some  of  the  radome  effects  are  insertion  loss,  beam  broadening,  boresight  error,  sidelobe 
degradation,  increased  cross-pol  levels,  etc.  These  higher  order  effects  associated  with  the  aircraft 
and  radome  structures  can  significantly  limit  the  capability  of  the  modem  airborne  radar  antenna 
systems  in  isolating  and  locating  targets. 

In  the  recent  years  many  electromagnetic  (EM)  analysis  techniques  and  computer  codes  have 
been  developed  to  predict  the  radiation  patterns  of  various  airborne  antennas  or  radome  effects. 
However,  there  are  still  no  efficient  computer  codes  that  can  accurately  predict  total  installed 
performance  of  large  antenna  systems  mounted  on  a  large  aircraft.  That  is,  some  analysis  codes  can 
predict  the  airframe  interactions  of  antenna  systems  but  are  lacking  the  radome  effects.  Other 
analysis  codes  can  predict  radome  effects  but  are  lacking  airframe  interactions.  Some  other  codes 
have  some  limited  capabilities  to  predict  both  airframe  interactions  and  radome  effects  but  do  not 
provide  the  computational  efficiency  desired  for  practical  airborne  antenna  designs.  Thus,  we 
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investigated  the  capabilities  and  limitations  of  currently  available  analysis  techniques  and  computer 
codes  for  the  installed  performance  of  practical  airborne  antenna  systems.  Then  we  developed  an 
extended  ray-optical  technique  that  bridges  the  gap  between  aircraft  scattering  analysis  codes  and 
radome  analysis  codes  for  the  total  installed  performance  (i.e.,  aircraft  interactions  and  radome 
effects)  of  large  airborne  antenna  systems. 

This  paper  will  show  a  brief  description  of  high  frequency  EM  analysis  techniques  used  for 
installed  performance,  an  extended  ray-optical  approach  for  both  the  radome  and  airframe  scattering 
effects,  and  practical  examples  of  an  airborne  antenna  system  on  a  large  aircraft. 

2.0  HIGH  FREQUENCY  EM  ANALYSIS  TECHNIQUES  FOR  INSTALLED  PERFORMANCE 

There  are  many  high  frequency  EM  analysis  tools  available  in  these  days  for  the  radiation 
prediction  of  antenna  systems  mounted  on  an  aircraft.  These  simulation  tools  generally  fall  under  the 
categories  of  Uniform  Geometrical  Theory  of  Diffraction  (UTD)  based  analysis  codes  or  Physical 
Theory  of  Diffraction  (PTD)  based  analysis  codes.  The  UTD  based  antenna  analysis  codes  include 
NEC/Basic  Scattering  Code  (BSC)  [1]  and  Aircraft  Code  [2].  The  PTD  based  antenna  analysis  codes 
includes  the  APATCH  code  [3]  that  also  utilizes  the  Shooting  and  Bouncing  Ray  (SBR)  technique. 
Most  of  radome  analysis  codes  are  based  on  either  ray-tracing  technique  or  Physical  Optics  (PO) 
technique.  Each  of  these  techniques  has  its  advantages  and  limitations,  depending  on  the  particular 
application. 

The  UTD,  which  is  used  in  this  paper  for  airframe  interactions,  is  a  ray  optical  technique. 
Figure  1  shows  various  first  order  UTD  terms  incorporated  into  one  of  the  UTD  codes  [2]. 


a)  DIRECT  SOURCE  FIELD 


d)  EDGE  DIFFRACTED  FIELD 


b)  SOURCE  FIELD  c)  REFLECTED  FIELD 


e)  CURVED  JUNCTION  EDGE  f)  CORNER  DIFFRACTED 

DIFFRACTED  FIELD  FIELD 


Figure  1.  Various  First  Order  UTD  Terms  For  Aircraft  Interactions 


The  total  radiated  field  is  the  superposition  of  the  following  UTD  field  components:  1)  direct 
field  from  the  source;  2)  curved  surface  diffracted  fields  from  the  fuselage  (referred  to  as  the  source 
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field  here);  3)  reflected  fields  from  wings  or  stabilizers;  4)  diffracted  fields  from  the  leading  or 
trailing  edges  of  the  wings  or  stabilizers,  including  diffraction  from  the  curved  junction  edges  formed 
by  the  intersection  of  the  wings  with  the  fuselage;  5)  and  vertex  diffraction  from  each  of  the  comers. 
In  addition  to  the  above  first-order  scattering  from  a  structure,  the  rays  reflected  or  diffracted  from 
one  structure  tend  to  interact  with  other  structures  causing  various  higher  order  UTD  terms  such  as 
double  reflected,  reflected-diffracted,  diffracted-reflected,  and  double  diffracted.  Thus,  the  total 
radiated  field  of  an  antenna/array  on  an  aircraft  structure  can  be  expressed  as  a  summation  of  the 
above  individual  UTD  terms.  Note  that  separation  of  these  terms  based  on  path  length  might  be 
important  for  accurately  modeling  the  temporal  dispersion  caused  by  the  aircraft  interactions. 


3.0  EXTENDED  RAY-OPTICAL  TECHNIQUE  FOR  TOTAL  INSTALLED  PERFORMANCE 

We  developed  an  extended  ray-optical  technique  that  combines  the  capabilities  of  both  an 
airframe  scattering  analysis  code  and  a  radome  analysis  code  for  the  total  installed  performance  of 
large  airborne  antenna  systems.  The  new  technique  can  use  either  analytic  surface  or  faceted  surface 
for  true  radome  geometry.  Accurate  modeling  of  the  radome  surface  is  very  important  in  EM  analysis 
since  most  radomes  are  in  the  proximity  of  the  antenna  aperture.  It  is  note  that  the  closer  any 
scattering  object  is  to  the  antenna  aperture,  the  bigger  its  effects  are  on  the  resulting  patterns.  The 
new  analysis  technique  utilizes  a  ray-tracing  method  in  the  radome  simulation  and  incorporates  the 
computed  radome  effects  into  ray  optical  fields  in  the  airframe  scattering  analysis  code.  Since  the 
same  ray  technique  is  used  for  both  aircraft  interactions  and  radome  simulations,  the  analysis  code  is 
computationally  veiy  efficient. 

Figure  2  shows  more  details  on  the  extended  ray-optical  process.  First  the  aircraft  analysis  code 
is  used  to  determine  the  geodesic  path  between  an  antenna  element  and  an  observation  point.  Next 
the  radome  code  is  used  to  find  an  intersection  point  and  corresponding  reflection/transmission 
coefficients  on  the  radome  wall.  Then  the  amplitude  and  phase  of  the  transmitted  ray  from  an 
antenna  element  are  modified  by  radome  transmission/reflection  coefficients  and  the  modified  ray  is 
used  as  an  incident  ray  at  the  diffraction  or  reflection  points  on  the  aircraft  structure. 


Radome  Wall 


Figure  2.  Extended  Ray-Optical  Technique  For  Installed  Performance  Of  Airborne 
Antenna  Systems  w/  Radome 


88 


4.0  PRACTICAL  EXAMPLES  OF  AN  AIRBORNE  ANTENNA  SYSTEM 


Figure  3  shows  the  configuration  of  a  typical  dorsal  mounted  airborne  antenna  system.  We  fully 
evaluated  the  total  installed  performance  (i.e.,  airframe  interactions  and  radome  effects)  of  the 
airborne  antenna  system  using  the  new  analysis  code  based  on  the  extended  ray-optical  technique. 
The  airborne  antenna  system  consists  of  an  ellipsoid  shaped  radome  and  an  aperture  inside  as  shown 
in  the  figure. 

First  we  evaluated  the  radome  effects  on  the  array  patterns.  Figure  4  shows  a  comparison  of  the 
elevation  plane  patterns  of  the  array  with  and  without  radome  surface.  The  comparison  clearly  shows 
the  effects  of  radome  such  as  insertion  loss,  mainbeam  broadening,  boresight  error  and  cross 
polarization,  etc.  Note  that  the  patterns  shown  in  Figure  4  include  only  the  direct  transmission 
through  the  radome  wall  and  do  not  include  the  reflection  effects  from  the  radome  wall. 


Figure  3.  Computer  Model  for  a  Dosal  Mounted  Airborne  Antenna  System 


Figure  4.  Elevation  Patterns  of  the  Array  w/  and  w/o  Direct  Transmission  through 
Radome  Surface. 
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Figure  5  shows  a  comparison  of  elevation  patterns  of  the  array  with  and  without  the  radome 
effects  including  both  direct  transmission  and  wall  reflection.  The  comparison  indicates  that  the 
radome  wall  reflection  effects  are  increased  sidelobes  between  445°  to  +70°angular  section.  It  is 
noted  that  radome  wall  reflection  depends  on  the  radome  shape  and  antenna  scan  angle.  Since  we 
used  the  -2°  elevation  scan  angle  of  the  array  in  this  simulation,  the  radome  wall  effects  in  upper 
angular  section  appeared  to  be  more  severe  than  in  the  lower  angular  section.  Figure  6  shows  the 
boresight  shift  on  main  beam  due  to  the  radome  surface.  The  boresight  shift  appears  to  be  -  0.24°  in 
this  case. 


Figure  5.  Elevation  Patterns  of  the  Array  w/  and  w/o  total  Radome  Effects. 


Without  Radome 


Figure  6.  Sectional  Volumetric  Patterns  in  Main  Beam  Region  with  and  without  Radome 


With  Radome 
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Figure  7  Computer  Model  of  an  Array  with  a  Radome  on  an  Aircraft. 


Figure  7  shows  the  configuration  of  a  computer  model  for  an  aperture  inside  a  radome  and  an 
aircraft.  First  we  evaluated  the  airframe  interactions  on  the  array  radiation  patterns  without 
considering  radome  effects.  Figure  8  shows  an  elevation  pattern  of  the  array  on  an  airframe  without  a 
radome.  It  is  noted  that  the  elevation  cut  is  a  great  circle  cut  through  the  mainbeam  that  is  10°  squint 
in  azimuth  direction.  The  elevation  plane  pattern  for  an  array  on  the  airframe  is  also  compared  with 
that  in  free  space  to  show  the  airframe  interaction  effects.  The  comparison  clearly  shows  that  the 
blockage  effect  by  the  wing  appears  in  the  -90°  to  +30°angular  section  and  the  sidelobe  degradation 
due  to  the  reflection  off  the  wing  appears  in  +30°  to  +90°angular  section.  The  comparison  shows  that 
the  main  beam  and  a  few  sidelobes  near  the  main  beam  are  not  affected  by  the  aircraft  interactions. 

Next  we  evaluated  the  total  installed  performance  of  the  array  that  includes  both  airframe 
interactions  and  radome  effects.  Figure  9  shows  an  elevation  pattern  of  the  array  with  a  radome  on  an 
airframe.  The  elevation  plane  pattern  for  an  array  on  the  airframe  is  also  compared  with  that  in  free 
space  to  show  the  airframe  interactions  and  radome  effects.  The  comparison  clearly  shows  that  the 
same  blockage  and  reflection  effects  of  the  airframe  in  the  ±90°  to  ±30°  angular  sections  and  radome 
effects  around  main  beam  region  including  insertion  loss,  mainbeam  broadening,  boresight  shift,  and 
sidelobe  degradation.  The  computation  time  for  the  elevation  patterns  in  Figures  8  and  9  was  1  hour 
and  3  minutes  for  the  array  with  a  radome  and  1  hour  for  the  array  without  a  radome  on  a  Sun 
workstation.  It  is  noted  that  the  array  used  in  this  calculation  consists  of  2432  radiating  elements. 
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Figure  8.  Elevation  Pattern  of  the  Array  on  an  Aircraft  without  Radome. 


Figure  9.  Elevation  Pattern  of  an  Array  with  a  Radome  on  an  Aircraft. 

5.0  CONCLUSIONS 

We  developed  an  extended  ray-optical  technique  that  bridges  the  gap  between  the  aircraft 
scattering  analysis  codes  and  the  radome  analysis  codes  for  the  total  installed  performance  of 
practica/airbome  radar  antenna  systems.  The  new  analysis  technique  uses  the  same  ray  technique  on 
both  airframe  and  radome  simulations.  Thus,  it  is  computationally  very  efficient  and  can  be  used  to 
analyze  total  installed  performance  of  a  large  antenna  system  on  a  large  airframe.  The  developed 
analysis  code  has  been  successfully  used  for  the  installed  performance  of  a  number  of  practical 
airborne  surveillance  radar  systems  and  its  accuracy  has  been  fully  verified  with  extensive  range 
measurements.  Typical  computation  time  required  for  the  radome  analysis  is  only  a  small  fraction 
(e.g.,  less  than  5%)  of  the  time  required  for  the  total  installed  performance  analysis. 
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ABSTRACT:  Three  antennas  are  used  as  examples  for  demonstrating  the  utility  antenna  imaging  and  visualization  for 
determining  where  on  the  structure  radiation  comes  from.  The  image  approach  is  bistatic  k-space.  This  analytic  technique 
uses  a  single  excitation  as  computed  from  any  frequency  domain  code  such  as  MOM. 

The  three  antennas  are  a  Yagi,  a  long  wire  end  fed  traveling  antenna,  and  a  dipole  fed  parabolic  dish.  Visualization  results 
include  one  and  two-dimensional  radiation  images,  near  field  distributions,  far  field  radiation  patterns,  and  current 
distributions.  The  electromagnetic  performance  was  computed  with  a  patch  method  of  moments  code. 

The  Yagi  images  show  which  of  the  elements  are  contributing  energy  to  the  far  field.  The  long  wire  traveling  wave 
antenna  images  show  that  radiation  is  produced  along  the  entire  length  of  the  wire  when  the  observer  is  located  at  the  peak 
of  the  forward  or  reflected  lobe.  When  the  observer  is  located  in  the  side  lobe  region  perpendicular  to  the  wire,  the 
radiation  sources  appear  at  the  end  regions  of  the  wire.  The  dipole  fed  parabolic  dish  antenna  images  show  the  relative 
levels  of  radiation  contributed  by  die  dipole  feed  and  the  reflector. 

ANTENNA  IMAGING:  Radar  cross  section  imaging  has  been  in  use  since  the  late  ‘70s.  It  has  been  instrumental  in  our 
understanding  of  scattering  centers  and  the  physical  mechanisms  contributing  to  radar  cross  section.  Experimental  imaging 
requires  short  pulse  or  wide  bandwidth  radiation.  This  is  applicable  for  scattering  but  not  for  typical  antenna  applications 
due  to  their  usually  limited  bandwidths.  This  has  greatly  reduced  our  ability  to  experimentally  image  antenna  radiation 
center  “hot  spots”  and  to  better  understand  the  physical  mechanisms  contributing  to  main  beam  and  side  lobe  radiation. 
Antenna  imaging  should  improve  our  understanding  not  only  of  simple  antennas,  but  more  importantly,  complex  antenna 
environments  on  air  vehicles,  ships,  and  satellites  where  host  body  /  antenna  interaction  affect  performance. 

The  k-space  image  technique,  however,  is  applicable  to  antennas  since  no  excitation  frequency  sweeps  are  required. 
Bistatic  k-space  imaging  uses  the  computed  current  distribution  at  a  single  frequency  to  obtain  an  image.  The  examples 
shown  in  this  paper  are  among  the  first  computational  high  resolution  antenna  images  known. 

VISUALIZATION  IN  COMPUTATIONAL  ELECTROMAGNETICS:  The  ultimate  goal  of  computational  modeling 
is  to  predict  measured  or  observed  phenomena.  For  many  years,  computational  scattering  or  antenna  EM  modeling  output 
consisted  of  a  backscatter  or  antenna  radiation  pattern  that  hopefiilly  matched  experimental  results.  In  this  mode, 
computational  modeling  did  little  to  aid  the  identification  and  understanding  of  the  physical  mechanisms  responsible  for 
observed  behavior.  This  limitation  was  unfortunate  because  computational  EM  codes  should  in  principle  be  able  to 
provide  a  rich  insight  to  the  scattering  or  radiation  process  if  we  just  ask  the  proper  questions  and  use  the  rapidly 
developing  graphical  display  capabilities  to  illustrate  the  basic  phenomena.  Our  EM  visualization  inspiration  is  similar  to 
efforts  in  computational  fluid  dynamics;  we  are  attempting  to  better  understand  the  interaction  of  an  EM  wave  or  local 
excitation  with  a  structure,  the  nature  of  body-body  interactions,  and  the  nature  of  the  resultant  radiation. 

Visualization  diagnostics  for  electromagnetics  involve  the  display  of  body  currents,  near  and  far  fields  and  radiation 
images.  Each  display  tells  us  something  different  about  the  radiation  process.  Taken  together,  these  diagnostics  produce  a 
more  complete  understanding. 

Currents  (or  effective  tangential  magnetic  or  electric  fields)  are  the  fundamental  quantities  which  produce  far  field 
radiation  patterns  and  are  often  the  computational  goal,  e.g.,  MOM  codes.  Knowledge  of  the  current  amplitude  and  phase 
distribution  on  a  geometric  structure  is  sufficient  to  compute  the  radiation  pattern.  Although  a  graphical  display  of  currents 
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indicates  the  spatial  amplitude  distribution,  it  does  not  tell  how  the  currents  radiate  in  different  directions  in  space,  which  is 
determined  by  the  phase  alone. 

Displays  of  near  field  E  and  H  indicate:  how  one  part  of  a  structure  interacts  with  another  part;  how  the  surface  currents 
and  charges  begin  to  form  the  eventual  far-field  radiation  pattern;  and  how  various  surface  current  modes  are  formed,  such 
as  surface  and  edge  traveling-wave  mechanisms. 

Imaging  is  yet  another  diagnostic  tool  for  examining  the  scattering/radiation  process.  An  image  is  not  a  picture  of  currents, 
but  of  the  far  field  radiation  produced  by  the  currents.  Images  differ  from  a  body  current  distribution  in  that  an  image 
shows  how  the  currents  radiate  collectively.  An  image  from  a  given  angle  tells  us  which  parts  of  the  current  distribution 
are  producing  radiation  in  that  direction. 

The  difference  between  images  and  far  field  radiation  patterns  is  that  the  far  field  pattern  is  the  phasor  sum  of  signals  from 
all  radiation  centers  on  the  body  which  radiate  in  the  observed  direction.  An  image  identifies  each  center  as  a  separate 
entity.  The  utility  of  images  results  from  the  ability  to  separate  and  identify  the  individual  radiation  centers  which  have 
collectively  summed  to  produce  the  net  result.  This  enables  us  to  understand  the  nature  of  each  radiation  center  and  also  to 
modify  it  in  ways  which  might  be  useful. 

Several  EM  visualization  computer  codes  are  now  beginning  to  appear.  Horn  [1]  at  NASA  Langley  has  developed  EM 
ANIMATE  for  use  on  Silicon  Graphics  workstations.  Marietta  Scientific,  Inc.  is  developing  EM  VIZ  [2],  a  PC  /  Windows 
based  visualization  program.  EM  VIZ  input  consists  of  points,  triangles,  and  currents  at  the  triangle  vertices.  This  data  can 
be  generated  by  any  frequency  domain  code  such  as  the  method  of  moments  or  physical  optics.  EM  VIZ  uses  this  current  / 
geometry  input  information  to  display  animated  currents  and  computed  near  fields,  bistatic  k-space  images,  and  far  field 
radiation  patterns.  The  results  presented  in  this  paper  used  EM  VIZ. 

The  results  presented  here  were  computed  using  MOM3D,  a  surface  patch  method  of  moments  code  [3]  whose  output  was 
formatted  for  input  to  EM  VIZ. 

Further  background  on  the  bistatic  k-space  image  technique  may  be  found  in  [4-5]  and  in  [6-7]  for  other  aspects  of 
visualization. 

BISTATIC  VTF.W  REGION:  Bistatic  k-space  images  are  obtained  with  one  current  distribution  by  computing  the 
bistatically  radiated  field  about  an  extended  region  centered  on  the  view  direction,  Figure  1(a).  This  viewing  region 
increases  as  the  user  specified  resolution  Ar  becomes  smaller.  The  k-space  fractional  bandwidth  and  resolution  are  related 
by  the  fundamental  relation  Ar  Ak  =  2n.  As  the  resolution  Ar  becomes  small,  Ak/k0  increases.  This  means  that  bistatic 
radiation  over  a  wider  angle  is  being  sampled  to  form  an  image,  Figure  1(b).  Resolutions  of  Ar  =  0.5  and  1.0  X  have 
fractional  bandwidths  Ak/ko  of  2ko  and  1  ko  which  correspond  to  view  angle  ranges  of  +/-  45°  and  26°  respectively  on 
either  side  of  the  nominal  receiver  direction. 

YAGI  ANTENNA:  The  Yagi  antenna  example  is  a  five-element  array  with  one  reflector  element,  one  driven  element,  and 
three  director  elements.  The  frequency  was  2  GHz  and  element  lengths  were  0.0850m,  0.0882m,  and  0.0674m 
respectively.  The  antenna  “wires”  were  modeled  as  flat  narrow  triangles.  The  patch  geometry  is  shown  as  inserts  in 
several  of  die  results  shown  in  Figure  2. 

The  E  and  H  plane  far  field  radiation  patterns  are  shown  in  Figures  2  (a)  and  (b).  The  maximum  gam  is  approximately  6.9 
dBi  and  the  front  to  back  ratio  is  approximately  8  dB.  The  “time”  snapshot  H  plane  near  fields,  as  computed  from  the 
current  solution  (and  resulting  surface  divergence  for  electric  charge)  is  shown  in  Figure  2(c).  The  forward  peak  radiated 
lobe  is  readily  seen  as  well  as  the  beginning  formation  of  the  upper  and  lower  side  lobes. 

Images  of  electrically  small  antennas,  i.e.,  those  whose  dimensions  are  near  the  image  resolution  limit,  are  difficult  because 
the  minimum  resolution  cell  is  not  much  smaller  than  the  antenna  itself.  This  Yagi  antenna  falls  into  this  category. 
However,  rectangular  widowing,  linear  scales,  and  zero  FFT  padding  can  alleviate  this  situation  somewhat. 

Bistatic  k-space  images  were  computed  with  the  receiver  oriented  on  the  main  beam  of  the  antenna.  The  specified 
resolution  was  0.25  X  so  that  the  bistatic  fractional  bandwidth  was  4k0  corresponding  to  a  bistatic-viewing  angle  of  +/- 
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180°.  Images  computations  used  a  rectangular  widow  function  to  maximize  resolution.  The  resulting  FFT  was  zero 
padded  for  smoothing.  The  image  amplitudes  are  shown  on  a  linear  scale  rather  than  log  scale  to  improve  sensitivity  of 
like  scatterers.  The  ID  and  2D  images  clearly  show  maximum  radiation  is  produced  by  the  driven  element,  as  one  might 
expect.  The  next  major  contributor  to  the  main  beam  are  the  currents  on  the  middle  director  element. 

We  did  not  try  to  optimize  this  antenna  for  gain  or  front  to  back  ratio.  However,  it  is  clear  that  the  visualizations  presented 
would  considerably  aid  an  engineer  engaged  in  the  optimization  process  by  illustrating  the  physical  results  obtained  by 
changes  in  configuration,  feed  points,  and  frequency. 

FIVE  LAMBDA  TRAVELING  WAVE  ANTENNA:  Our  next  sample  antenna  case  is  an  electrically  larger  antenna. 
The  well-known  end  fed  long  wire  has  its  main  radiation  lobes  near  the  axis  for  the  forward  and  reflected  traveling  waves 

on  the  wire.  Wire  length  and  wavelength  give  the  angular  peak  location  of  the  end  fire  radiation,  6  =  A9yJX/l  =  22° 
measured  from  grazing.  The  MOM  antenna  model  was  a  thin  triangular  strip  of  width  0.01  X. 

The  E  plane  axis  symmetric  radiation  pattern  is  shown  in  Figure  3(a).  The  feed  point  is  on  the  right  side  of  the  antenna 
insert.  The  main  traveling  wave  lobe  is  approximately  10  dBi  when  looking  into  the  feed  point  and  7  dBi  when  looking  at 
the  far  end  traveling  wave  reflection  point. 

The  near  electric  field  in  the  E  plane  is  shown  in  Figure  3(b)  where  the  four  end  fire  lobes  are  clearly  seen  along  with  the 
interference  pattern  produced  by  the  two  oppositely  directed  traveling  wave  currents. 

The  bistatic  image  of  this  antenna  was  computed  for  a  resolution  of  0.5  X  that  corresponds  to  a  k-space  fractional 
bandwidth  of  2ko  and  +/-  45°  bistatic  view  angle.  A  rectangular  FFT  data  window  function  was  utilized  to  maximize 
resolution  and  a  linear  scale  used  to  illustrate  the  resulting  image.  The  FFT  data  was  zero  padded  for  smoothing. 

The  first  2D  image,  Figure  3(c),  is  centered  on  the  main  forward  traveling  wave  lobe  so  that  the  feed  point  is  toward  the 
back  end  of  the  figure.  The  image  suggests  that  radiation  is  produced  along  the  entire  length  of  the  antenna  with  a  peak 
near  the  rear  feed  point.  At  this  angle,  the  radiation  from  all  points  on  the  traveling  wave  adds  in  phase  along  the  antenna 
length.  In  this  main  beam  direction,  the  path  length  phase  and  the  phase  of  the  traveling  wave  currents  constructively 
phase  add. 

The  second  2D  image,  Figure  3(d),  is  centered  on  the  reflected  traveling  wave  (TW)  lobe.  The  far  end  of  this  figure  is 
where  the  forward  TW  current  reflects  from  the  wire  end.  This  image  suggests  that  radiation  is  produced  along  the  entire 
length  of  the  reflected  TW  wire  currents,  with  a  peak  at  the  rear  termination.  The  radiation  image  amplitude  decreases  as 
the  TW  current  loses  energy  due  to  the  radiation  process.  A  linear  amplitude  scale  is  the  same  for  all  three  images  so  that 
the  reader  can  estimate  relative  strengths. 

The  third  2D  image.  Figure  3(e),  is  centered  perpendicular  to  the  wire  in  the  side  lobe  region.  This  radiation  image 
strongly  suggests  that  the  contributions  to  the  side  lobe  region  are  due  to  the  end  points  of  the  antenna.  The  side  lobes  are 
caused  by  the  phase  addition  and  subtraction  of  the  end  point  regions:  peaks  when  the  distances  to  each  end  phase  add; 
nulls  when  the  distances  phase  subtract. 

PARABOLIC  DISH  WITH  DIPOLE  FEED:  Our  last  sample  antenna,  Figure  4(a),  is  a  five  lambda  diameter  parabolic 
dish  antenna  with  a  focal  point  /  diameter  ratio,  F/D,  of  0.35.  The  dipole  feed  is  placed  at  the  focal  point.  The  focal  point 
was  chosen  to  be  an  odd  number  of  quarter  wavelengths  so  that  the  dish  reflected  radiation  would  phase  add  with  direct 
dipole  radiation.  No  attempt  was  made  to  place  a  reflector  behind  the  dipole  to  reduce  its  radiation  in  the  main  beam 
direction.  The  far  field  radiation  pattern,  Figure  4(b),  shows  a  dipole  like  pattern  with  slightly  higher  gain  on  the  reflector 
axis.  We  see  that  the  reflector  basically  converts  a  fraction  of  the  back  dipole  radiation  to  phase  add  with  the  front  dipole 
radiation.  A  “time”  snapshot  of  the  reflector  dish  currents  is  shown  in  Figure  4(c)  where  axis  symmetric  rings  of  constant 
phase  currents  are  seen.  The  E  plane  near  fields,  Figure  4(f),  shows  the  formation  of  the  main  beam. 

The  ID  and  2D  radiation  images,  Figure  4(d-e),  with  the  observer  on  the  parabolic  axis,  shows  two  major  contributors  to 
the  far  field  radiation.  The  dipole  feed  is  the  main  radiation  source.  The  reflector  dish  contributes  about  60%  of  the  dipole 
front  radiation.  This  suggests  that  the  reflector  dish  intercepts  about  60%  of  the  rear  dipole  radiated  energy  for  conversion 
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to  the  main  beam.  The  resolution  for  these  images  is  0.5  X  with  a  +/-  45°  bistatic  view  angle  to  either  side  of  the  image 
position.  Note  that  the  ID  image  scale  is  linear  while  the  2D  image  scale  is  log. 

SUMMARY:  The  utility  of  electromagnetic  visualization  for  antenna  problems  has  been  illustrated.  Visualization  clearly 
aids  in  showing  the  physical  processes  by  which  radiation  process  occurs. 
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Nominal  Bistatic  Look  Angle  as  a  function 
of  Resolution  in  Wavelengths 


Resolution  in  Wavelengths 


Figure  1b:  Bistatic  k-space  look  angle  versus  resolution 
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I  INTRODUCTION 

Since  their  first  published  analysis  (Cohen,  1995),  fractals  as  antenna  elements  have 
stimulated  considerable  investigation  (see,  for  example,  Cohen,  1998;  Puente  et.  al.,  1996; 
Valdivia;  1996).  As  with  all  new  investigations,  this  has  met  with  some  wrong  turns  and 
corrections  (see,  for  example,  Cohen,  1997).  Although  considerable  progress  on  the 
behavior  and  use  of  fractal  element  antennae  (FEA)  has  been  realized,  a  long-standing 
misconception  still  persists  regarding  their  ability  to  function  as  frequency  invariant  or 
frequency  independent  antennas.  Here  we  present  NEC4  simulations  of  simple  fractal 
dipoles  to  demonstrate  that  while  self-  similarity  brings  very  broad  bandwidth,  it  does  not 
produce  a  concomitant  invariance  with  power  pattern  and  gain. 

EL  MODEL  AND  SIMULATIONS 

In  Cohen  (1998)  the  VSWR  (and  hence  impedance)  characteristics  of  a  fractalized  dipole 
were  made  evident  by  the  application  of  a  Koch  triangle  fractal  generator  over  two 
successive  iterations  on  the  dipole.  At  wavelengths  where  the  antenna  was  electrically  large 
in  its  biggest  dimension,  the  Koch  dipole  became  extremely  broad  band,  showing  by 
example  that  a  deterministic  fractal  is  a  naturally  broad  band  antenna,  even  in  a  small 
number  of  iterations. 

Iterations  are  a  key  issue  with  fractal  antennae.  There  is  no  a  priori  reason  to  assume  that 
invariance  of  one  observable  is  emulated  by  other  observables  in  the  same  iteration.  Puente 
et  al.(1997)  have  used  analysis  of  fractal  arrays  to  argue  that  FEA  — when  taken  to  many 
iterations — are  also  frequency  independent  or,  alternatively,  have  discrete  bands  of 
invariance  (that  is,  frequency  invariance)  for  finite  iterations  (it  should  be  noted  that  the 
fractal  array  analyses  incorporate  geometric  symmetries  in  addition  to  fractal  geometry.) 
This  argument  is  used  to  explain  the  Sierpinski  monopole’s  multiband/frequency  invariant 
properties.  (Puente  et.al,1997). 

Even  with  a  perfect  antenna,  a  large  iteration  number  is  unattainable.  With  large  number  of 
iterations,  the  width  of  wire,  or,  alternatively,  etched  connection  width,  becomes  so  small 
that  ohmic  losses  dominate  the  impedance  and  gain  observables.  Hence  we  chose  to  model 
the  Koch  dipole  to  a  4th  iteration,  where  ohmic  losses  were  still  minimal  and,  like  the  4th 
iteration  Sierpinski  monopole  (Puente  et  aL,  1997)  the  finite  but  non-trivial  iteration  number 
may  be  expected  to  manifest  some  invariance  properties — should  they  be  a  property  of 
fractals. 
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The  model  is  shown  in  Figure  1.  The  length  in  the  X  dimension  is  lA  wave  at  65  MHz. 
Number  12  copper  wire  was  modeled  in  free  space.  The  feed  was  placed  at  the  center  with  a 
short,  symmetric  horizontal  wire  length  and  NEC4  was  used  to  accomplish  the  analysis  from 
500-2000  MHz.  Segment  densities  were  approximately  twice  that  of  conservative  limits  at 
each  frequency.  A  450  MHz  PC  platform  was  employed. 


Table  1 


Frequency  (MHz)  Immdance(ohms)  VSWR  (300  ohms )  Gain  (dBi) 


500 

311  +j4 

1.04 

5.4 

750 

329  -  j  1 15 

1.45 

0.6 

1000 

298  -  j  150 

1.65 

3.9 

1250 

374-J174 

1.74 

5.2 

1500 

195-J113 

1.87 

4.5 

1750 

594  -  j55 

1.88 

4.2 

2000 

354  -  i!32 

1.54 

6.5 

m.  RESULTS 

Figure  2  shows  the  stacked  comparison  of  the  current  distribution  amplitudes.  As  noted  in 
Cohen  (1998)  for  the  2nd  iteration  Koch  dipole,  the  current  distribution  ‘pulls  in’  with 
higher  frequency,  much  as  with  the  cells  on  a  log  periodic  array.  A  priori,  the  truncation  by 
iteration  number  would  produce  less  fine  structure  in  the  wire  bending  for  which  these 
frequencies  are  exposed,  and  hence  the  biggest  deviation  in  characteristics  from  the  lower 
frequencies. 

However,  Table  1  reveals  that  while  the  VSWR  remains  reasonably  invariant  across  the 
range,  the  gains  vary  by  6  dB.  There  is  no  simple  dependence  with  increasing  frequency. 
The  gain  is  not  invariant  with,  or  independent  of  frequency. 


Figure  3  shows  two  representative  azimuthal  power  patterns  across  the  frequency  range. 
The  patterns  bare  little  resemblance  to  each  other.  Again,  the  pattern  is  not  invariant  with,  or 
independent  of  frequency. 

IV.  DISCUSSION 

Since  only  one  counterexample  illustrates  the  lack  of  validity  of  a  generalized  statement,  we 
must  conclude  that  frequency  invariant  and  independent  properties  are  not  inherent  to 
fractals  as  antenna  elements. 

It  thus  seems  salient  to  explore  what  additional  geometric  constraints  are  necessary  to 
achieve  the  effect.  Rumsey’s  Principle  (Rumsey,1966)  connotes  both  self-complementanty 
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and  a  constant  size/sparing  ratio,  as  with  the  case  of  log  periodic  antennae.  Impedance 
a  s  ow  chan8e  of  impedance  is  achievable  from  fractal  designs  so 

“■Rissssr,"  - — — —  *  v 

In5inFlffUr^4  ^  ’ lS  3  3  lttrf°n  K°Ch  dip°le  (again  1/2  wave  “  X  at  65  MHz)  which  is 

Figure  4(b)  is  “■ symmetric  *• 

niustrates  tte  frequency  behavior  for  VSWR  and  gain  for  these  antennae  Neither  is 
truly  frequency  independent.  However,  the  origin  symmetric  version  has  an  excursion  of 
only  about  0.4  dB  from  about  1000-2000  MHz.  This,  in  mo*  practical  cir^n*  a 

? *°“  aPP™*™*1™  to  frequency  independence.  The  azimuthal  pattern  is  not  invariant  but 
v™  °f  ab°Ut  25  degreeS  f°r  ^  main  lobe  across  frequency  ra ZX 

FurtWif!?  ^’*871110161110  VT10n  ^  ,3rge  gain  excursions  over  the  frequency  rlnge 
Furthennore  it  expenences  orthogonal  reversals  in  the  main  lobe  angle  over  the  frequency 

range.  It  does  not  approximate  frequency  independent  or  frequency  invariant  propertied.  Y 

It  is  important  to  state  that  the  monopole  Sierpinski  fractal  element  studied  by  Puente  et  al 
w  ongm  symmetric  with  respect  to  its  mirror  image  Thus  it  would  appear  that  fractal  design 
wfrh  on^n  symmetry  may  have  some  utility  in  describing  and"  producing  frequent 
invariant/independent  antennae.  It  has  yet  to  be  shown  that  these  two  characteristics  are 
uniquely  acceptable  for  mciusion  within  Rumsey’s  principle  or  some  new  corollary  of  it. 


Table  2 


Origin  Symmetric 


X-Axis  Symmetric 


Freg.fMHz )  VSWR  (300  ohms)  Grain  (dm 


500  2.0  3.6 

750  2.2  5  9 

1000  1.8  5.1 

1250  1.7  4.7 

1500  1.4  49 

1750  1.3  4  g 

2000 _ L7 _  4.7 


Freq.(MHz )  VSWR  (300  ohms)  Gain  CdRi) 


500  1.8  3.8 

750  1.6  6.3 

1000  1.9  5.6 

1250  1.9  3.6 

1500  1.2  3  2 

1750  1.2  4.5 

2000 _ L6 _ 3j 
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V.  CONCLUSION 


FEA  are  known  to  have  wideband  impedance  behavior  when  electrically  large.  However 
this  characteristic,  or  multiband  (that  is  impedance  invariant)  behavior,  does  not  necessarily 
imply  invariance  of  gain  and  power  pattern.  We  have  shown  that  frequency 
invariant/independent  behavior  is  not  uniquely  attributable  to  a  fractal  design.  Additional 
geometric  constraints  dictate  this  possibility.  Origin  symmetry  may  be  used  to  produce  some 
FEA  which  have  useful  frequency  invariant  or  frequency  independent  characteristics. 
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Abstract: 

This  paper  discusses  the  continuing  development  of  antenna  training  aids  at  the  Australian  Army’s  Army 
Engineering  Agency  (AEA)  for  the  Army’s  School  of  Signals,  and  the  Navy’s  HMAS  Cerberus  Frequency 
Management  School.  The  training  aids  rely  heavily  on  the  ability  to  produce  high-quality  rendered  images  of 
antenna  radiation  patterns  using  electromagnetic  visualisation  techniques  developed  at  AEA.  Data  compression 
techniques  developed  for  storage  of  antenna  patterns  permit  the  display  of  conventional  radiation  pattern  graphs 
and  together  with  a  limited  ionospheric  modelling  capability,  of  footprints  of  over-the-horizon  communications 
parameters.  Adjustment  of  parameters  such  as  frequency,  antenna  height,  slope,  configuration,  ground  type  and 
viewing  angle  control  the  selection  and  display  of  the  appropriate  antenna  pattern  image.  Interactive  operation 
coupled  with  the  ability  to  view  patterns  either  as  a  rendered  3D  image,  a  graph,  or  a  footprint  provides  excellent 
insight  into  the  intricacies  of  antenna  radiation  patterns.  This  paper  updates  and  consolidates  a  number  of  the 
author’s  previous  papers  on  antenna  training  aids  and  related  subjects. 

Background: 

Reliable  communications  depend  on  the  best  use  of  the  available  equipment,  and  effective  exploitation  of 
the  communications  medium.  While  this  is  true  of  any  communications  method,  communication  in  the  high 
frequency  (HF)  band,  from  2  MHz  to  30  MHz  requires  attention  to  matters  considered  by  many  to  fall  into  the  realm 
of  Ihe  black  arts'.  Both  the  radio  equipment  and  their  antennas  are  generally  required  to  operate  over  a  wider 
range  of  frequencies  (15:1)  than  equipment  operating  in  other  bands.  As  ionospheric  conditions  change  during  the 
day,  corresponding  frequency  changes  are  required  to  maintain  communications  availability  at  an  acceptable  level. 

For  some  antennas,  particularly  wide  band  antennas,  the  radiation  pattern  shape  can  vary  significantly 
with  frequency,  and  other  parameters.  While  the  radio  equipment  -  the  transmitter  and  receiver  -  may  function  well 
over  the  required  band,  and  even  match  the  antenna  system,  unless  due  attention  is  paid  to  the  choice  of  antenna, 
its  configuration,  siting  and  orientation,  the  shape  of  the  radiation  pattern  may  make  communications  difficult,  if  not 
impossible.  The  use  of  low  transmitter  power  to  conserve  battery  life  or  for  covert  operation  compounds  the 
problem. 

Proper  antenna  training  will  promote  better  understanding  and  exploitation  of  antenna  radiation  patterns, 
leading  to  improved  communications  availability  and  reliability.  High  performance  personal  computers,  with  high- 
capacity  hard  drives  are  becoming  more  readily  available  both  in  the  training  environment  and  in  the  field  and  can 
provide  useful  platforms  for  which  visual  antenna  training  aids  can  be  developed. 

initial  work  at  AEA: 

In  1991,  AEA  obtained  Version  2  of  the  Lawrence  Livermore  Laboratory’s  Numerical  Electromagnetic 
Code  (NEC)1  for  near  field  assessment.  This  was  interfaced  to  AutoCAD®  by  procedures  written  in  AutoLISP®,  a 
programming  language  embedded  in  AutoCAD.  These  interfaces  provided  an  environment  for  the  creation  and 
examination  of  NEC  input  files  in  a  graphical  context  (Reference  1). 

The  interfaces  were  extended  to  plot  contours  and  colour  maps  of  near  field  data  and  to  present  far-field 
radiation  pattern  data  as  three  dimensional  meshes  (figure  1).  Using  the  SlideShow  capability  of  AutoCAD, 
screen-dumps  were  saved  as  .SLD  files,  and  by  linking  these  images  together  with  a  script  file,  animations  of  field 
patterns  were  created.  Although  these  ran  fairly  slowly,  they  showed  considerable  promise  as  a  means  of 
conveying  an  understanding  of  pattern  characteristics.  An  upgrade  to  AutoCAD  Version  12  allowed  creation  of 
rendered  images  of  patterns  (figure  2).  The  images  were  more  ‘realistic’,  and  despite  the  minimal  edge  treatment 
of  faces,  further  demonstrated  the  potential  of  these  techniques. 


1  Distributed  as  part  of  NEEDS  2.0  through  the  Applied  Computational  Electromagnetics  Society  (ACES),  CA,  USA 
®AutoCAD  and  AutoLISP  are  registered  trademarks  of  AutoDesk  Inc. 
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The  rendering  and  animation  package  3D  Studio®  Version  2  was  obtained  and  interfaces  written  to  directly  read 
NEC  output  files.  Using  this  capability  ‘photographic1  quality  images  of  3D  radiation  patterns  could  be  readily 
created  either  as  still  images  or  animation  files.  This  work  is  discussed  in  References  2  and  3.  Using  the  inherent 
capabilities  of  3D  Studio,  animated  frequency  and  other  scales,  backgrounds  and  other  objects  can  be  created  to 
annotate  the  images.  Careful  attention  to  lighting,  shadowing,  camera  positioning,  the  use  of  fog  to  suggest 
distance,  choice  of  appropriate  images,  colour  and  textures  for  backgrounds,  and  for  the  representation  of  the 
earth  and  of  the  antenna  pattern  itself,  can  all  provide  subtle  clues  which  greatly  enhance  the  perception  and 
comprehension  of  the  pattern  (see  figures  3  and  4).  Creation  of  good  images  is  not  a  process  to  be  taken  lightly, 
but  requires  considerable  practice  to  develop  and  maintain  the  appropriate  skills. _ 


Figure  1 :  Mesh  radiation  pattern  image  (AutoCAD) 


Figure  3: 
(3D  Studio) 


Rendered  radiation  pattern  image 


Figure  2:  Rendered  radiation  pattern  image 

(AutoCAD) 


Figure  4:  Rendered  radiation  pattern  image  in  a  scene 
(3D  Studio) 


Creation  of  an  animation  or  a  series  of  images  in  3D  Studio  requires  the  assignment  of  keys  to  the  objects 
to  be  animated.  These  modify  the  position,  rotation  shape,  scale  and  other  parameters  of  the  objects,  lights  and 
cameras,  thus  creating  the  frame-by-frame  changes  for  the  animation.  In  3D  Studio  Version  2,  the  keys  were 
created  by  hand.  In  one  instance  over  11000  keys  were  created  to  produce  an  animation  of  a  HF  radiation  pattern 
sweeping  from  2  to  30  MHz,  and  viewed  from  64  different  viewpoints  -  a  multi-dimensional  set  of  radiation  pattern 
images.  Creation  of  the  keys  alone  took  just  over  three  weeks  but  yielded  an  excellent  result.  Later  work  using  3D 
Studio  Version  4  used  programs  written  in  the  internal  language  ‘Keyscripf,  reducing  the  key  creation  time  to  less 
than  one  minute  (Reference  4). 


^30  Studio  is  a  registered  trademark  of  AutoDesk  Inc. 


108 


The  image  output  from  3D  Studio  can  be  saved  as  easily  as  an  animation  file,  a  series  of  still  images  or  to 
videotape.  The  training  aids  use  still  images  and  the  Joint  Photographic  Experts  Group  (.JPG)  format  easily  gave 
the  best  image  quality  for  a  given  file  size,  allowing  almost  60000  images  to  be  stored  on  a  single  compact  disk 
(CD).  By  careful  choice  of  compression  parameters,  image  format  and  other  options,  this  number  could  be 
extended  to  over  80000. 

An  antenna-specific  viewer  is  written  to  display  the  image  in  response  to  selection  of  frequency,  antenna 
geometry  (height,  angle,  configuration  etc),  ground  type,  viewing  angle  and  other  parameters.  The  user  can  then 
display  the  appropriate  image  in  terms  of  antenna  parameters,  rather  than  a  file  name.  The  viewers  for  different 
antennas  are  adapted  from  a  generic  viewer  by  the  addition  of  controls  which  are  specific  to  that  antenna. 

Considerable  effort  has  recently  been  spent  on  reducing  the  manual  effort  needed  to  produce  the  large 
number  of  images  needed  for  the  training  aids.  To  simplify  multiple  modelling  runs,  modifications  have  been  made 
to  NEC  to  allow  passing  of  input  and  output  filenames  as  command  line  arguments.  In  addition  a  program  was 
written  to  read  the  NEC  input  file  and  directly  create  an  input  file  for  the  NEC  Sommerfeld/Norton  ground  modelling 
program  SOMNEC.  These  changes  considerably  enhance  batch  mode  running  of  NEC  2. 

Image  creation  process: 

Keeping  track  of  the  large  number  of  files  involved  with  the  image  creation  process  demands  a  structured 
approach  to  file  names.  File  names  used  throughout  the  image  creation  process  are  generic,  and  are  derived  as 
follows:.  The  first  two  characters  are  a  unique  two-character  designator  (BD)  referring  to  the  antenna  name 
(Bidirectional  Delta  in  this  case),  The  next  two  digits  (nn)  form  a  unique  number  assigned  to  each  lowest  level 
directory.  The  remaining  two  digits  (mm)  refer  to  the  ordinal  position  of  the  file  in  the  set.  Other  characters 
identify  file  types.  Thus  BD01,I00  is  the  first  NEC  input  file  in  directory  1  for  a  Bidirectional  Delta  antenna. 

The  process  commences  with  the  creation  of  purpose-written  program  (BDBUILD.EXE)  based  on  the 
antenna  geometry  and  the  range  of  variants.  This  program  builds  a  directory  structure  based  on  the  parameters  to 
be  user-adjustable  in  the  final  training  aid.  Typically,  these  would  include  ground  type  and  the  range  of  different 
antenna  configurations.  At  the  same  time  an  Index  File  (BD.NDX)  is  written  in  the  root  directory.  This  lists  generic 
file  names  and  corresponding  directory  paths  and  is  used  to  control  much  of  the  subsequent  processing.  For  each 
directory  NEC  input  files  (BDnn.Imm)  are  written,  together  with  a  NEC  batch  file  (BDnn.BAT)  to  control  the 
running  of  NEC  and  SOMNEC  within  that  directory.  One  input  file  is  generally  used  for  each  frequency  so  that 
segmentation  can  be  optimised.  Finally,  a  Master  batch  file  (BD.BAT)  is  written  in  the  root  directory  to  call 
individual  NEC  batch  files  and  run  the  whole  modelling  process. 

The  production  of  a  typical  training  aid  can  require  the  creation  and  running  of  up  to  5000  separate  NEC 
input  files.  Once  the  creation  program  is  written,  the  processes  of  directory  and  file  creation  typically  take  less 
than  ten  minutes. 

The  Master  batch  file  then  calls  SOMNEC  and  NEC  for  each  of  the  input  files  in  each  directory  to  write 
corresponding  output  files  (BDnn.Omm).  For  simple  wire  antennas  (typically  up  to  about  20  wires),  the  5000 
modelling  runs  take  about  3  hours  using  a  200  MHz  Pentium-S  CPU.  More  complex  structures  such  as  antennas 
on  vehicles  take  correspondingly  longer  times. 

Once  all  the  modelling  runs  are  completed,  post-processing  is  initiated.  NEC  output  files  are  read  using 
the  Index  file  for  navigation,  resulting  in  the  creation  of  the  following  files  for  each  directory: 

•  Three-dimensional  Drawing  exchange  Format  (BDnn.DXF)  files  of  radiation  pattern  shapes  used  to 
input  the  pattern  data  to  3D  Studio, 

•  files  containing  data  on  the  maximum  envelopes  of  a  sets  of  radiation  pattern  data  used  to  calculate 
camera  and  lighting  positions  (BDnn.ENV), 

•  BDnn.VUE  files,  used  to  control  object,  light  and  camera  positioning  during  rendering  by  3D  Studio, 

At  the  same  time  files  containing  pattern  data  in  tabular  form  (BDnn.ANT),  files  which  contain  summary 
data  and  any  NEC  error  messages  (BDnn.SUM),  and  files  of  compressed  antenna  pattern  data  (BDnn.DC3)  are 
automatically  written  to  each  lowest  level  directory. 

The.DXF  files  are  manually  loaded  into  3D  Studio  and  merged  with  a  previously  created  file  (BASIS.3DS) 
containing  common  elements  such  as  a  dummy  radiation  pattern,  earth,  sky,  lights  and  a  camera.  The  resulting 
scene  is  then  saved  (BD01.3DS).  An  Options  file  (BDnn.OPT)  is  also  created  to  control  various  image 
parameters  and  a  Rendering  batch  file  (BDREND.BAT)  is  written  in  the  root  directory  to  create  the  images  by  3D 
Studio  rendering  of  the  scenes. 
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1  To  other  processes  1 
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Figure  5:  Training  aid  creation  process  for  Bidirectional  Delta  antenna.  3D  boxes  indicate  that  the  same 
structures  that  are  repeated  across  multiple  directories  for  different  antenna  configurations. 
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The  Rendering  bach  file  is  then  run  to  call  3D  Studio  and  pass  control  to  the  appropriate  .VUE  and  .OPT 
file  to  create  the  .JPG  image  files.  For  each  rendering  run  within  a  directory  3D  Studio  automatically  generates 
file  names  of  the  form  BDnnffff.JPG,  where  ffff  is  the  frame  number  in  the  series  of  images.  For  the  Bidirectional 
Delta,  493  images  were  created  per  directory  corresponding  to  29  frequencies  and  17  viewpoints.  Once  rendering 
is  initiated  it  runs  unattended  for  the  whole  suite  of  directories.  The  rendering  time  for  each  image  varies  with 
image  content,  but  averages  23  seconds,  a  run  of  50000  images  taking  some  13  days  to  complete. 

In  parallel  with  the  image  generation  an  antenna-specific  viewer  is  written.  This  provides  the  means  for 
the  end  user  to  navigate  through  the  large  number  of  rendered  radiation  pattern  images  in  electromagnetic  terms, 
rather  than  by  using  file  names.  In  addition,  it  has  interfaces  to  other  methods  of  antenna  pattern  display,  which 
are  covered  later  in  this  paper. 

The  creation  of  training  aids  using  rendered  images  of  radiation  patterns  relies  heavily  on  the  ability  to 
rapidly  create  and  process  large  numbers  of  antenna  models.  The  above  processes  of  image  generation  could,  in 
theory,  be  adapted  to  virtually  any  electromagnetic  engine.  However,  without  the  ability  to  generate  input  files  and 
process  output  files  using  external  applications,  the  time  needed  to  manually  perform  these  operations  would  be 
excessive,  and  the  process  would  not  be  viable.  Such  processing  requires  a  detailed  knowledge  of  the  associated 
file  formats,  and  reticence  on  the  part  of  some  software  suppliers  to  release  this  information  prevents  their 
modelling  engines  from  being  used  in  this  way.  NEC  has  well-documented  input  and  output  file  formats,  and 
together  with  the  modifications  for  batch  mode  operation,  it  is  well  suited  to  this  application. 

Antenna  pattern  compression: 

Concurrent  with  the  development  of  the  image-based  training  aids,  data  compression  methods  for  antenna 
patterns  were  being  developed.  Modelling  of  a  number  of  in-service  antennas  indicated  that  to  capture  the 
structure  of  patterns,  particularly  for  those  antennas  which  were  large  or  at  significant  heights  above  the  ground,  a 
minimum  resolution  of  5°  in  azimuth  and  elevation  was  required.  This  is  the  same  resolution  as  used  in  the  above 
image  generation  process.  For  the  upper  hemisphere  of  the  pattern,  this  requires  1368  points  per  frequency.  In 
many  cases,  for  adequate  frequency  resolution  across  the  HF  band,  1  MHz  steps  have  been  found  necessary, 
yielding  pattern  data  files  of  about  160  kByte.  The  data  compression  methods  of  Reference  5  were  developed  in 
order  to  efficiently  embed  this  amount  of  data  in  an  application.  Although  the  compression  is  lossy,  the  errors 
introduced  during  compression  and  decompression  are  restricted  to  the  low  gain  areas  (nulls)  of  the  radiation 
pattern  where  they  are  of  little  importance  to  the  intended  application.  Both  full-hemisphere  skywave  as  well  as 
groundwave  patterns  are  stored  using  the  AEA  developed  algorithms  which  compress  the  total  data  to  under  700 
bytes  per  frequency.  In  addition  the  decompression  process  intrinsically  provides  interpolation  between  the 
original  data  points. 

The  pattern  file  also  contains  a  header  with  a  variety  of  antenna-related  parameters.  These  include: 

•  the  antenna  name,  plus  7  supplementary  text  fields, 

•  a  unique  antenna  identifying  number, 

•  the  antenna  model  version, 

•  the  boresite  bearing  of  the  antenna  (usually  0°), 

•  a  ‘directional  antenna’  flag,  and 

•  the  antenna  frequency  limits. 

Sufficient  data  to  display  a  3D  drawing  of  the  antenna  is  also  stored,  and  this  can  accommodate  up  to  128 
wires,  30  masts,  8  sources  and  8  loads.  The  header  information  increases  the  file  size  by  about  1.4  kByte, 
resulting  in  a  final  file  size  for  an  antenna  specified  at  1  MHz  steps  from  2  to  30  MHz  of  under  22kByte. 

Using  this  compressed  file  format,  displays  of  horizontal  and  conical  cuts  of  the  antenna  pattern  can 
readily  be  produced.  The  conical  cut  is  taken  at  a  user-specified  takeoff  angle,  and  is  more  appropriate  in  the 
context  of  skywave  communications  than  the  traditional  horizontal  cut. 

If  the  pattern  data  is  combined  with  an  ionospheric  modelling  capability,  antenna  performance  can  be 
displayed  as  footprints  of  over-the-horizon  parameters  which  are  dependent  on  both  the  ionosphere  and  the 
antenna  pattern.  These  include  operational  frequencies  such  as  the  maximum  useable  frequency  (MUF),  optimum 
working  frequency  (OWF)  and  lowest  usable  frequency  (LUF).  Other  antenna-related  useful  parameters  that  can 
be  displayed  include  are  path  length,  number  of  hops,  takeoff  angle,  to  and  from  bearings,  propagation  mode, 
antenna  gain,  probability  of  intercept,  and  signal  to  noise  ratio.  The  range  of  plots  is  limited  only  by  the  range  of 
parameters  available  from  the  embedded  ionospheric  modelling  techniques. 
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Current  work: 

Although  the  three  methods  of  antenna  pattern  display  described  above  were  developed  independently, 
work  is  proceeding  on  the  creation  of  a  single  training  aid,  which  embraces  the  three  display  types  together  with 
other  antenna  information  as  separate  pages  of  a  tabbed  notebook.  To  summarise,  the  three  display  types  are: 

•  Rendered  images:  These  give  an  easily  understood  overall  impression  of  the  shape  of  the  pattern.  The 
ability  to  view  the  pattern  from  different  viewpoints,  and  to  observe  the  effects  of  changes  of  frequency, 
ground  type  and  antenna  configuration  provide  an  environment  where  the  shape  of  the  pattern  can  be 
readily  conceptualised. 

•  Graphical  display:  This  displays  the  pattern  data  using  traditional  graphical  methods  and  can  also 
provide  numerical  data.  A  vertical  cut  and  a  conical  cut  at  a  user-selected  takeoff  angle  are  displayed. 

•  Map  display:  This  shows  the  effects  of  the  interaction  between  the  antenna  pattern  and  user-selected 
ionospheric  conditions,  displaying  area  coverage  footprints  of  the  antenna  and  other  communications 
data. 

Other  pages  provide  additional  antenna  as  follows: 


•  a  photograph  of  the  erected  antenna, 

•  a  line  drawing  based  on  data  from  the  existing  antenna  handbook  and  showing  antenna  components 
and  connections, 


•  textual  information  about  the  antenna  components,  related  documentation,  and  operational 
parameters. 

Figures  6  to  8  show  screens  show  screens  from  the  three  pattern  display  capabilities  which  are  to  be 
embedded  in  a  single  training  aid. 
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Figure  6:  Radiation  pattern  rendered  image  display 
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Figure  7:  Antenna  gain  graphical  display. 


Figure  8:  Antenna  gain  as  an  area  coverage  map. 


113 


Documentation: 

AEA  quality  procedures  require  that  software-based  products  for  use  outside  the  Agency  be  documented 
in  accordance  with  an  approved  software  specification,  originally  MIL-STD-498.  Such  documentation  has  been 
prepared  for  much  of  the  training  aid,  and  currently  comprises  the  Software  Development  Plan  (SDP),  Software 
Design  Description  (SDD),  Software  Product  Specification  (SPS),  System/Subsystem  Specification  (SSS), 
Software  Test  Plan  (STP),  Software  Test  Report  (STR),  and  the  Software  User  Manual  (SUM),  together  with  a  100 
page  work  instruction  on  the  image  creation  process  and  related  issues.  These  documents  were  compiled  during 
the  development  of  the  various  stand-alone  capabilities  of  the  training  aid  for  the  RAVEN  Bidirectional  Delta 
antenna,  and  require  to  be  merged  into  a  coherent  whole.  However  as  the  functionality  of  the  training  aids  is 
different  for  different  antennas,  updating  and  extension  will  be  required  as  additional  training  aids  are  produced. 

Antenna  data: 

In  order  for  the  above  training  aids  and  related  applications  to  embrace  further  antennas,  accurate 
geometric  data  on  antennas  in  use  will  be  required  to  create  the  electromagnetic  models  from  which  the  gain 
patterns  are  computed.  In  the  case  of  fixed  installations  this  is  generally  not  a  great  problem,  as  the  antenna 
geometry  and  that  of  the  surroundings  is  fixed.  However,  in  the  case  of  field  antennas,  which  are  often  expedient, 
obtaining  adequate  data  on  such  antennas  and  their  range  of  variants  presents  considerable  difficulty.  While 
antenna  handbooks  give  broad-brush  descriptions  of  antenna  dimensions,  in  practice  these  are  often  not  adhered 
to.  Wisely  applied,  changing  element  slopes  or  adding  a  reflector  or  counterpoise  can  improve  communications 
performance.  In  such  a  case  the  ‘antenna  intuition’  which  the  training  aids  aim  to  develop,  can  be  of  considerable 
benefit.  But  in  order  to  provide  the  maximum  usefulness,  geometric  data  on  a  wider  range  of  antennas  and 
variants  needs  to  be  captured  so  that  they  can  be  modelled  and  embedded  in  the  applications.  Previous 
experience  suggests  that  attending  Signals  Unit  exercises  with  a  steel  tape,  theodolite  camera  and  a  keen  eye 
might  be  the  only  way  to  obtain  this  urgently  needed  data. 

Conclusions: 

Aspects  of  the  evolution  of  an  antenna  training  aid  capability  developed  by  AEA  for  the  Australian  Army’s 
School  of  Signals  and  HMAS  Cerberus  Frequency  Management  School  has  been  presented.  Strategies  have 
been  discussed  for  the  efficient  production  of  high-quality  antenna  radiation  pattern  images  and  of  the 
compression  of  antenna  pattern  data  based  on  antenna  modelling  using  NEC.  These  techniques  have  been  used 
to  create  to  self-contained,  CD-based  antenna  training  aids  which  are  easy  to  operate  and  provide  excellent 
insight  into  the  complexities  of  antenna  radiation  patterns.  The  best  of  all  worlds  is  provided  by  the  ability  to  view 
the  same  pattern  as  a  rendered  3D  image,  graphically,  or  as  footprints  of  pattern  related  data.  To  extend  the 
usefulness  of  the  applications,  antenna  geometric  data  covering  the  full  range  of  antennas  in  use,  particularly 
expedient  antennas,  needs  to  be  captured  so  that  the  antennas  can  be  modelled.  The  training  aids  find 
application  not  only  in  the  classroom,  but  with  the  increasing  availability  of  field  computing,  could  extend  into  the 
tactical  area,  providing  guidance  in  the  field  on  antenna  choice,  configuration  and  siting.  Properly  applied,  this  wtl 
lead  to  an  improvement  in  communications  availability  and  the  development  of  an  ‘antenna  intuition  at  a  level 
previously  obtained  only  after  may  years  of  field  experience. 
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Extension  of  the  Method  of  Lines  for  Planar  3D  Structures. 

Yuriy  0.  Shlepnev 
Eagleware  Corporation 
4772  Stone  Drive,  Tucker,  GA,  30084,  USA 
Tel.  770-939-0156,  fax  770-939-0157 
E-mail:  yuriy@eagleware.com 

Abstract.  The  Method  of  Lines  (MoL)  is  extended  for  planar  3D  structures  with  conductivity 
currents  flowing  across  media  layers.  The  extended  MoL  is  applicable  for  the  analysis  of 
discontinuities  in  planar  (M)MIC’s  lines  with  vias  and  other  connectors  between  signal  layers.  It 
also  makes  it  possible  to  analyze  discontinuities  taking  into  account  metallization  thickness.  An 
admittance  matrix  of  a  layer  with  currents  across  it  in  a  grid  spectral  domain  is  presented  as  the  basic 
element  of  the  algorithm.  The  extended  MoL  is  illustrated  by  analysis  of  a  via-hole  grounded 
microstrip  line. 

Introduction.  The  method  of  partial  discretization  or  method  of  lines  is  the  well-established 
technique  for  the  analysis  of  different  microwave  structures.  It  can  be  successfully  used  for  the 
analysis  of  2.5D  (M)MIC  discontinuities  [1-4].  The  one  theoretical  restriction  of  2.5D  technique  is 
absence  of  conductivity  currents  inside  a  layered  structure  between  signal  layers.  The  signal  or 
metallization  layers  are  placed  on  the  interfaces  of  the  media  layers  and  are  supposed  to  have  zero 
thickness.  The  MoL  was  recently  extended  for  the  analysis  of  some  4-port  3D  MMIC  structures  [5], 
but  applications  of  this  expansion  are  still  limited. 

This  paper  proposes  a  technique  to  extend  the  2.5D  technique  developed  for  the  analysis  of 
discontinuities  in  planar  lines  [1-4]  on  planar  3D  structures.  The  technique  keeps  2D  discretization 
and  spectral  representation  for  homogeneous  media  layers  as  for  usual  2.5D  MoL  and  introduces  a 
model  of  a  media  layer  with  constant  conductivity  currents  across  the  layer.  A  combination  of  the 
signal  layers  with  2D  currents,  layers  with  the  vertical  currents,  and  the  simple  homogeneous  media 
layers  makes  it  possible  to  perform  a  kind  of  3D  discretization  for  some  regions  of  a  (M)MIC 
discontinuity.  The  classification  planar  3D  emphasizes  the  fact  that  vertical  planes  in  those 
structures  have  infinitesimally  small  conductivity  current  components  other  then  vertical.  An 
analytical  semi-discrete  solution  for  homogeneous  media  layers  as  well  as  for  the  layers  with  the 
vertical  currents  transfers  the  advantages  of  the  MoL  on  the  planar  3D  (M)MIC  structures.  The 
metallization  thickness,  vias,  and  different  kinds  of  bridges  can  be  simulated  in  this  way. 

The  described  technique  is  implemented  for  one  signal  layer  in  a  program  for  electromagnetic 
simulation  of  (M)MIC,  =EMPOWER=,  developed  at  Eagleware  Corporation  (www.eagleware.com). 
Geometrical  symmetry  processing,  the  super-grid  technique  and  the  prime  factor  algorithm  for  the 
discrete  Fourier  transformation  have  been  used  in  the  program  to  expand  capabilities  and  to 
accelerate  the  base  MoL  procedures.  The  generalized  scattering  parameters  of  a  discontinuity  are 
extracted  by  the  method  of  simultaneous  diagonalization  [6,7].  A  regular  version  6.5  A  of  the 
program  is  used  to  generate  examples  for  this  paper. 

Problem  Formulation.  Let’s  consider  a  rectangular  domain  Q  in  the  Cartesian  axes: 

O  =  {(x,y,z;  |  o  <  x  <  a,  o  <  y  <  b,  d0  <  z  <  dP} 

The  domain  has  P  media  layers  with  the  different  or  the  same  properties:  _ 

e(x,y,zl  =  ep,  n(x,y,z;  =  np,  ('x,y,z;eQp,  Qp  =  ^x,y,z;  1 0  <  x  <  a,0  <  y  <  b,dp_i  <  z  <  dp}p  =  1,P. 
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Here  £p, Up  are  absolute  dielectric  permittivity  and  permeability  of  the  layer  p.  The  domain  is 
bounded  by  the  electric  or  magnetic  walls.  The  media  layers  are  homogeneous  and  can  be  lossy.  A 
layer  can  have  constant  conductivity  current  across  it  with  only  one  component  jzof  the  current 

volume  density  vector.  The  electric  (EP)  and  magnetic  (HP  )  fields  are  related  by  the  Maxwell’s 
equations  inside  a  layer  p  for  a  harmonic  signal  with  the  radian  frequency  co: 


rotE  =-i<onpH  (x,y,z)€np  (1) 

divEP  =  0,  divHP  =  0 

Conductivity  currents  with  x  and  y  components  can  exist  only  in  signal  layers  z  =  dj  parallel  to  the 
medium  layer  interfaces.  If  Uj  is  surface  conductivity  current  density,  the  generalized  boundary 
conditions  for  the  signal  layers  are: 

z  x  fHP+1  (dj )  -  HP  (dj  )1  =  r|  j 

,  '  ii  =  l,P-l  (2) 


To  complete  the  problem  formulation  for  a  (M)MIC  structure,  additional  boundary  conditions  on 
perfect  or  lossy  metal  surfaces  and  in  resistive  film  regions  must  be  defined.  To  model  external 
inputs  of  the  structure  and  lumped  element  connections,  auxiliary  port  regions  are  introduced  into 
the  problem  domain.  The  desired  solution  of  the  electromagnetic  problem  is  an  immitance  matrix 
relating  the  integral  voltages  and  currents  in  all  port  regions.  After  connection  of  the  lumped 
elements  to  the  corresponding  ports,  the  immitance  matrix  can  be  transformed  into  a  generalized  Y- 
or  S-matrix  using  the  simultaneous  diagonalization  method  [7]  or  by  other  means. 

Thus  we  have  the  problem  formulation  that  is  appropriate  for  a  wide  range  of  (M)MIC  devices. 
Notice  that  the  problem  domain  is  not  restricted  by  planar  structures,  because  most  of  the  three 
dimensional  objects  with  negligible  x-  and  y-directed  currents  on  the  mantle  can  be  approximated  by 
a  set  of  the  media  layers  with  z-directed  currents. 

Mapping  on  Grid.  To  approximate  the  problem  on  a  grid,  let’s  define  grid  variables  instead  of 
continuous  vector  functions.  It  is  convenient  to  define  the  grid  analogues  of  electric  and  magnetic 
fields  inside  a  layer  p  as  elements  of  matrices: 


Positions  of  the  grid  functions  correspond  to  the  x-y  plane  projection  of  the  offset  finite  difference 
scheme  [8j.  Grid  currents  can  be  defined  as  integrals  of  the  surface  current  components,  and 
constant  grid  currents  across  a  layer  can  be  defined  as  surface  integrals  of  the  current  volume  density 
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jz  near  the  grid  nodes: 

(4) 

Jy  =  / 2/w ny k ! / 2My . di K 1  =  >X rn  =  KM+lj,  i  =  IP^ 

dP_,  <z<dP,p=ip 

Having  defined  the  grid  equivalent  of  the  problem  functions,  we  can  approximate  Maxwell’s 
equations  (1)  by  a  system  of  differential-difference  equations  in  accordance  with  the  x-y  plane 
projection  of  the  finite-difference  scheme  [8].  The  continuity  conditions  (2)  on  the  grid  are 
approximated  by  the  same  conditions,  but  only  on  the  grid  nodes.  The  offset  difference  scheme 
approximates  the  initial  system  with  the  second  order  inside  a  layer.  The  global  approximation  order 
of  a  problem  is  usually  lower  because  of  possible  singularities  of  the  field  and  current  components 
on  the  metal  edges. 


Grid  Spectral  Domain  Solution.  Let’s  define  a  Grid  Green’s  Function  (GGF)  as  a  solution  of 
the  differential-difference  analogue  of  the  Maxwell’s  equations  excited  by  a  unit  grid  current  (Jx,Jy 
or  Jz).  This  response  function  is  a  discrete  function  in  the  x-y  plane  and  continuous  inside  a  layer 
along  the  z-axis.  To  solve  the  formulated  problem  we  need  a  contraction  of  the  GGF  to  the  signal 
planes  and  to  the  regions  with  non-zero  z-directed  currents.  This  contraction  is  a  matrix  due  to  the 
discretization. 

To  find  the  GGF  matrix,  a  spectral  approach  is  used  that  is  similar  to  the  spectral  domain 
technique.  Instead  of  continuous  TE  and  TM  rectangular  waveguide  eigenwaves,  their  grid 
analogues  are  used  as  a  basis  to  expand  the  electromagnetic  fields  inside  a  layer.  The  number  of  TE 
and  TM  waves  is  finite  and  their  system  is  complete.  This  means  that  instead  of  summation  of 
series,  as  in  the  spectra!  domain  approach,  we  have  finite  sums,  that  in  combination  with  the  grid 
propagation  constants  of  the  eigenwaves  provides  monotonic  convergence  of  the  algorithm.  So, 
transformation  of  the  grid  functions  (3,4)  to  the  grid  TE  and  TM  wave  basis  gives  us  new  variables: 
(ex/hy/3x}=FMex/hy/Jx)-Fy, 

(ey  /hx  / 3y)'Fx|  (ey  /hx  /Jy)-F^2  ,  (5) 

(®z  /  jz) =  Fx,  •  (ez  /  Jz)  -  Fyi 
^z)=  Fx2  •  (hz)- Fy2 

and  converts  the  differential-difference  system  of  equations  to  a  set  of  systems  of  ordinary 
differential  equations.  Fx1j2  and  Fylj2  in  (5)  are  transformation  matrices.  Symbol  1  is  a  matrix 
transposition  symbol.  A  solution  of  the  system  for  a  layer  number  p  with  z-directed  currents  and 
with  undefined  conditions  at  the  layer  boundaries  z  =  dp  and  z  =  dp.,  can  be  written  as  admittance 


matrices  for  each  pair  of  the  grid  TE  and  TM  eigenwaves: 
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Here  !  =  l,  L  + 1,  m  =  l,  M  + 1 ,  and  uz  are  voltages  across  the  layer  in  the  transformed  domain.  In  the 
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initial  domain  it  can  be  defined  as  u£  =  ^_{e^(z)dz .  Equations  for  the  Y-matrix  elements  are  given 
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(7) 


(r!Pm)2  =  kP  -  Z|2m<  kg  =  »VP,  *?m  =  Yxl  +  Yym  - 
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tp  =  dp  -dp_|, 


Y-matrices  for  media  layers  without  z-directed  currents  can  be  deduced  from  (6)  by  eliminating  the 
row  and  column  corresponding  to  the  z-directed  components,  and  eliminating  the  last  summand  in 
the  equations  for  the  matrix  elements  (7).  So,  we  have  (L+1)(M+1)-1  five-diagonal  symmetrical  Y- 
matrices  (6)  for  each  media  layer  that  can  be  used  to  superimpose  the  boundary  conditions  at  the 
interfaces  (2)  in  the  transformed  domain.  To  find  the  GGF  matrix,  a  unit  current  in  the  spectral 
domain  can  be  defined.  A  reply  on  the  current  excitation  is  tangential  local  voltages  at  some  layer 
interfaces  and  voltages  across  layers  with  z-directed  currents  in  the  transformed  domain.  The  replies 
can  be  found  independently  for  each  pair  of  the  grid  TE  and  TM  waves  by  simple  connection  of  Y- 
matrices  (6).  To  formalize  this  procedure,  a  conceptual  multi-port  corresponding  to  the  system  (6) 
can  be  introduced.  An  aggregate  of  these  solutions  for  each  possible  excitation  is  the  grid  spectral 
domain  solution  of  the  problem. 

Transforming  the  grid  spectral  domain  solution  back  to  the  spatial  domain,  we  get  the  GGF 
matrix.  Finally,  expressions  for  each  element  of  the  matrix  look  like  double  sums.  These  sums  can 
be  reduced  to  sums  of  four  elements  of  an  auxiliary  vector  with  dimension  6*(L+1)*(M+1)  per  one 
signal  layer.  The  vector  elements  depend  only  on  the  structure  of  the  package  and  grid  cell  size.  Its 
elements  are  calculated  via  discrete  Fourier  transforms.  Before  filling,  the  rank  of  the  GGF  matrix 
can  be  decreased  by  means  of  a  super-grid  technique  and  by  implementing  the  geometrical 
symmetry  of  the  problem.  The  super-grid  technique  gives  a  pseudo-non-equidistant  grid  over  the 
regular  grid  that  is  finer  near  metallization  edges  and  comers  and  coarser  inside  solid  regions.  It 
overcomes  the  restrictions  of  the  MoL  with  a  regular  grid  while  keeping  the  main  advantages  of  the 
equidistant  grid.  The  composed  GGF  matrix  are  to  be  transformed  to  an  immitance  matrix  (Y-  or  Z- 
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matrix)  relating  the  grid  currents  and  voltages  in  the  input  source  regions.  It  can  be  done  by  direct 
inversion,  factorization  or  by  an  iterative  procedure. 

Numerical  Example.  As  a  test  example,  we  consider  the  analysis  of  a  circular  via-hole  grounded 
microstrip  structure  described  in  [5]  and  shown  in  Fig  1 .  The  analyzed  structure  has  the  following 
parameters:  tl  =  0.794  mm,  el  =  2.32,  t2  =  23  mm,  e2  =  1,  a  =  5.175mm,  b  =  6.9mm,  w=2.3  mm, 
d=0.6  mm.  The  circular  cross-section  of  the  via-hole  is  approximated  on  the  grid  by  a  staircase 
boundary,  and  the  mantle  of  the  via-hole  cylinder  is  simulated  by  a  set  of  z-directed  currents.  A 
regular  version  6.5  A  of  the  =EMPOWER=  program  and  a  Pentium  200  MHz  computer  with  64  MB 
were  used  to  simulate  the  via-hole.  Fig.  2  shows  the  calculated  magnitude  of  the  transmission 
coefficient.  It  took  about  2  minutes  per  frequency  point  and  9  MB  of  computer  memory  to  solve  the 
problem  on  the  grid  with  cell  size  0.0575  mm  by  0.0575  mm.  The  result  obtained  by  the  hybrid 
finite-difference  and  finite-element  time-domain  method  in  [9]  is  also  depicted  in  the  Fig.  2  for 
comparison.  A  small  constant  bias  in  the  results  can  be  explained  by  differences  in  S-parameter 
extraction  techniques. 

Conclusion.  This  paper  presents  the  base  elements  for  the  extension  of  the  MoL  on  planar 
structures  with  vertical  or  z-directed  currents.  The  admittance  matrix,  relating  the  grid  analogues  of 
the  tangential  electric  and  magnetic  field  components  at  the  opposite  surfaces  of  a  layer,  z-directed 
current?  and  integral  of  z-directed  electric  field  inside  the  layer  in  the  transformed  domain,  is  given 
as  the  kernel  element  of  the  algorithm.  Other  parts  of  the  numerical  procedure  are  given 
schematically,  because  they  are  in  compliance  with  the  planar  version  of  the  MoL  and  the  spectral 
domain  technique.  To  author’s  knowledge,  the  MoL  was  not  developed  in  this  way,  despite  the  fact 
that  the  similar  integral  equations  techniques  were  extended  for  the  structures  with  the  vertical 
connections  [10-12]. 
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Fig.  1  A  circular  via-hole  grounded  microstrip  line  in  a  shielding  box.  Hatched  areas  near  the 
sidewalls  are  regions  of  auxiliary  ports  to  simulate  inputs  of  the  structure. 


2000  1.1  e+0  2e+04 

Frequency,  MHz 

Fig.  2  Magnitude  of  the  transmission  coefficient  of  the  via-hole  grounded  microstrip  with  the 
following  parameters:  tl  =  0.794  mm,  el  =  2.32,  t2  =  23  mm,  s2  =  1,  a  =  5.175mm,  b  =  6.9mm, 
w=2.3  mm,  d=0.6  mm.  Solid  line  -  this  method,  x  -  results  from  [9], 
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Noise  Issues  in  Submicron  Design 


Reinaido  Perez 
Jet  Propulsion  Laboratory 
California  Institute  of  Technology 


Abstract 


Often  in  Electromagnetic  Compatibility  the  main  concern  rest  on  the  effects  of  noise 
current  at  the  system,  subsystem,  and  even  circuit  level.  However,  with  the  increasing 
trend  of  even  more  smaller  devices,  the  issue  of  noise  is  becoming  also  of  importance  at 
the  component  level.  The  main  issues  concerning  noise  at  the  component  level  are 
addressed.  A  review  of  tools  that  could  help  address  noise  problem  at  the  device  level  are 
also  address. 

Introduction. 


It  has  been  well  known  for  many  years  that  analog  circuits  are  very  susceptible  to  noise 
problems  and  interference  in  general.  Noise  current  as  small  as  a  few  mA  (and  even  micro¬ 
amps)  can  cause  significant  degradation  in  analog  circuits.  For  a  while  digital  designers  did 
not  have  to  concern  themselves  with  interference  and  noise  at  the  digital  circuit  levels 
because  digital  circuits  were  highly  regarded  as  immune  to  noise  problems.  That 
understanding  started  to  change  as  mix-mode  design  came  into  the  picture  (i.e  mixed 
design  of  analog  and  digital  circuits  in  electronic  boards).  The  art  of  electromagnetic 
compatibility  experience  a  tremendous  boost  in  the  1980’s  and  early  1990’s  as  the  proper 
design  of  mix-mode  printed  circuit  boards  to  avoid  or  mitigate  interference  problems 
became  a  prevalent  issue.  The  use  of  new  design  techniques,  modeling  tools,  signal 
integrity  tools  became  common  in  the  language  of  the  PCB  industry  and  a  hot  topic  in 
conference,  journal  papers,  and  short  courses. 

Though  interference  and  noise  problems  are  still  a  important  issue  in  electronic  design  at 
the  subsystem  and  circuit  levels,  noise  is  also  starting  to  become  an  issue  at  the  submicron 
design  of  components.  Noise  has  become  a  problem  because  as  design  rules  reach  the  0.25 
pm  level  and  below,  interconnect  becomes  an  increasing  dominant  factor  in  the  design. 
Technology  scaling  is  causing  coupling  capacitance  between  on-chip  signals  to  become  a 
larger  portion  of  the  total  capacitance  within  the  IC.  At  the  same  time  because  transistor 
cross-sections  are  shrinking  and  the  number  of  interconnects  is  increasing,  the  quantity  of 
interconnect  related  capacitances  is  becoming  a  much  larger  portion  of  the  total  gate 
capacitances.  The  end  result  is  within  an  IC  certain  voltages  are  going  from  high-to  low, 
from  Vdd  to  ground,  from  ground  to  Vdd  in  a  “soup”  of  capacitive  parasitic  effects. 
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Noisy  Component  ICs. 


With  the  new  trends  in  technology  scaling,  the  more  aggressive  use  of  higher  clock 
speeds,  the  jamming  together  of  interconnect  wires,  the  decrease  of  threshold  voltages,  are 
all  conspiring  together  such  that  the  noise  immunity  of  digital  circuits  is  not  longer 
sufficient  to  protect  them  against  their  own  inherent  noise.  As  circuit  densities  continue  to 
increase,  interconnect  capacitance  will  continue  to  increase  becoming  an  increasing 
fraction  of  the  total  capacitance  of  the  device.  The  old  problems  related  with 
interconnections  between  devices  on  a  PCB  (signal  integrity  issues)  are  now  becoming 
also  a  problem  in  the  IC  themselves.  The  microscopic  metallic  slivers  that  interconnect  the 
devices  internally  are  now  starting  to  behave  as  transmission  lines  themselves.  There  has 
been  a  dramatic  change  is  aspect  ratio  (the  shrinking  width  of  interconnects  with  respect 
to  height).  Furthermore,  the  lines  are  also  much  closer  together,  often  closer  than  the  line 
thickness  themselves  an  in  many  cases  the  separation  is  less  than  the  separation  between 
lines  on  different  layers.  Therefore,  line-to-line  capacitance  on  the  same  layer  has  become 
dominant  [1-3]. 

In  PCB  the  layer-to-layer  interaction  could  be  the  dominant  factor,  signals  would  rarely 
interact  in  a  way  that  reduce  current  into  the  metal  line.  However,  as  the  iine-to-line 
capacitance  increases  it  is  very  common  to  find  coupling  problems  between  individual 
signals  where  one  line  acts  as  the  noise  source  capable  of  inducing  noise  current  on  a 
susceptible  line.  One  solution  that  has  been  bee  tried  in  the  past  to  decrease  line-to-line 
coupling  is  to  increase  the  spacing  between  lines.  Capacitive  effects  between  lines  can  also 
be  reduced  by  re-routing  the  lines.  The  use  of  lower  dielectric  constant  materials  between 
troublesome  lines  could  also  be  tried.  Noise  can  also  affect  switching  circuits  by  altering 
delays  in  the  propagation  paths  and  causing  timing  inaccuracy.  If  the  noise  source  line 
induces  a  current  into  a  susceptible  circuit  while  the  susceptible  circuit  is  changing  states, 
the  extra  current  into  a  susceptible  circuit  can  hinder  the  change.  Noise  coupled  between 
metal  lines  on  the  interconnect  can  also  move  from  a  switching  line  to  a  static  line,  or  even 
into  the  power  supply.  In  Figure  1  the  static  voltage  drop  across  package  and  chip 
resistance  reduces  the  supply  voltage  available  to  the  circuitry.  The  change  in  ground 
reference  due  to  changes  in  ground  current  further  reduces  the  voltage  available  to  the 
circuit. 
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PACKAGE  ON  CHIP 


DC  IR  Drop:  Variations  in  the  DC  power  supply  levels  on-chip  due  to  IR  drops  in  the 
Power/Ground  distribution 

Ground  Bounce:  Simultaneous  switching  of  off-chip  drivers  and  internal  circuits 
causes  periodic  variations  in  the  supply  and  ground  rails  due  to  inductance  on 
the  chip  and  package  supply  and  ground  wires. 

Figure  1.0  Illustration  of  Ground  Bounce. 


In  the  past  in  order  to  get  more  noise  immunity  under  control  “rule-based  approaches” 
have  been  implemented.  These  rule-based  approaches  however,  are  constrained  in  that 
they  don’t  apply  to  all  cases;  therefore,  more  rules  need  to  be  added  all  the  time  to 
incorporate  ever  more  increasing  complex  cases.  The  second  approach  is  a  better 
established  approach  and  is  based  on  static  noise  analysis.  Static  noise  analysis  asks 
whether  a  design  will  work  in  the  presence  of  noise.  It  calculates  the  noise  appearing  on 
every  net  and,  in  the  worst  case  scenario,  how  much  noise  can  appear  on  the  net. 
Furthermore,  static  timing  analysis  evaluates  the  noise  immunity  of  every  net,  flagging 
those  where  there  are  potential  failures.  Static  analysis  involves  looking  at  the  circuits 
interconnects  and  analyzes  the  entire  design.  High  frequency  effects  need  to  be  analyzed 
using  tools  that  just  go  beyond  static  timing  analysis.  These  effects  include  inductance  and 
resistance  in  signal/signal  return  lines.  In  Figure  2  capacitive  coupling  and  charge  sharing 
can  move  a  gate  voltage  beyond  a  threshold,  causing  that  gate  to  make  an  unwanted 
transition. 
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Charge-charing  noise 
at  node  D  causes  node 
“0”  to  change  state 


(2c) 

Figure  2.  Illustration  of  Capacitive  coupling,  charge  sharing,  causing  unwanted  transitions. 


Floorplanning  Guides 

In  rule-based  design  approach  for  IC  design,  floorplanning  a  chip  of  moderate  size  is  not  a 
difficult  task.  The  following  recommendations  are  easy  to  implement  and  can  help  greatly 

1)  Develop  a  wiring  plan,  such  that  as  the  horizontal  metal  1/vertical  metal  -2  scheme 
channel  routers  is  used.  Think  about  how  best  the  layers  can  be  exploited  to  make 
connections.  Develop  consistent  schemes  to  assign  layers  to  wires.  Employing  different 
layers  for  different  wire  directions  or  different  types  of  nets  will  you  the  best  approaches. 


2)  Include  small  components  into  larger  ones.  Many  block  diagrams  include  isolated  gates 
or  larger  components.  These  small  components  increase  the  design  complexities  since  they 
require  extra  effort  for  power  and  ground  routing  and  can  disrupt  the  flow  of  wiring 
across  the  chip.  Either  place  them  in  an  existing  larger  block  or  create  a  glue  logic  for  all 
of  these  loose  components. 

3)  Design  simple  wiring.  Move  first  the  blocks  then  the  pin  locations. 
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4)  Design  planar  wiring  as  much  as  you  can.  The  most  important  signals  should  have  a 
planar  routing  and  then  we  can  add  less  critical  signals  later. 

5)  Make  separate  wiring  plans  for  power  and  clock  signals. 

The  effects  of  charge  sharing  can  be  reduced  by  changes  in  circuit  topologies  [2]  as 
shown  in  Figure  3 


Beta  noise  can  be  tuned  to  trade  off 
one  noise  margin  against  the  other 


(3a) 
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DYNAMIC  LOGIC  GATE 


Increase  size  of 


Discussion  on  Tools: 

The  following  table  illustrate  a  list  of  tools  (some  are  relevent  to  computational 
electromagnetics)  which  provide  some  help  (and  the  help  is  described)  in  the  analysis  of 
submicron  IC  desing  for  assessing  noise  related  problems.  The  conclusion  from  this  table 
shows  that  in  reality  a  variety  of  tools  may  need  to  be  implemented  to  accomplish  the 
desired  results. 


Type  of  Tool 

Usefullness 

How  to  use  it 

Evaluation 

Integral  Equations 
(eg  MOM) 

Extremely  limited 

Capacitive  effects 
only 

Very  intensive  for 
very  little 
advantages 

Finite  Difference  & 
Finite  Time  Domain 

Limited 

Capacitive  effects 
mainly 

Very  intensive  for 
very  little 
advantages 

Transmission  Line 
Methods 

More  suitable.  Use 
widely  by 
commercial  signal 
integrity/EMI  tools 

As  its  name 
indicates,  can  easily  : 
model  High  Freq. 
Effects  of 
transmission  lines 

Efficient  in  terms  of 
computational  time, 
memory,  accuracy 
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Static/Quasi  Static 
Methods 

More  suitable.  The 
prefered  method  in 
CAD  tools  for  PCB 
and  IC  design 
(parasitic  extraction) 

Can  easily  model  all 
low  frequency 
transmission  line 
effects 

Efficient  in  terms  of 
computational  time, 
memory.  Limited 
accuracy  at  higher 
frequencies 

Lumped  Parameter 
Methods 

Suitable.  The 
prefered  method  in 
SPICE-Iike  tools  for 
circuit  analysis 

Suitable  only  at  low 
frequencies 

Efficient  bu  of 
limited  accuracy 

Charge  Transport 
Methods 

Very  suitable.  Under 
consideration  for 
labor  intensive  jobs 

Can  easily  and 
accurately  model 
capacitive,  resistive 
effects 

Very  computer 
intensive 

Table  1.  The  role  of  several  types  of  computational  tools  in  modeling  IC  parasitic  effects. 
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Abstract 

In  this  paper,  a  numerical  solution  of  the  scattering  of  electromagnetic  plane  waves  from  a  multi¬ 
layered  eccentric  cylinder  at  oblique  incidence  is  discussed  briefly.  The  cylinders  are  infinitely  long, 
penetrable,  and  of  circular  cross  sections.  Numerical  data  for  the  back-scattering  and  extinction  cross 
sections  are  presented  and  discussed  for  two  independent  polarizations  of  the  incident  plane  wave  (TM 
and  TE).  The  results  indicate  that  the  scattered  fields  by  an  eccentric  cylinder  are  sensitive  to  the 
eccentricity  parameters,  cylinder’s  material,  and  the  tilt  angle  of  the  incident  plane  wave. 


1.  Introduction 

The  scattering  of  electromagnetic  plane  waves  from  an  eccentric  multi-layered  cylinder  at  oblique 
incidence  was  investigated  analytically.  The  analytical  solution  was  obtained  by  solving  the  wave 
equation  in  different  spatial  regions  after  expressing  the  field  components  in  cylindrical  wave  functions 
and  applying  the  boundary  conditions  at  the  boundaries  [1].  Since  the  cylinders  were  eccentric,  the 
translational  addition  theorem  of  Bessel  functions  was  used  to  represent  the  field  components  in  terms 
of  the  local  coordinates  of  each  individual  cylinder  before  applying  the  boundary  condition  on  the 
surface  of  each  cylinder.  A  system  of  coupled  equations  with  unknown  expansion  coefficients  resulted 
and  has  been  solved  numerically.  In  this  study,  we  present  various  new  numerical  results  showing  the 
electromagnetic  properties  of  the  eccentric  cylinders. 


2.  Geometry  of  the  problem 

Figure  1  shows  M  (M  >2)  eccentric  cylinders  of  radii  q  (j=l,..,M).  Each  cylinder  is  defined  by  its  own 
cylindrical  coordinate  system,  i.e.,  (pj.<!>],z] )  and  the  axes  of  the  cylinders  are  parallel  to  the  z-axis 

and  are  separated  by  a  distance  d } . 
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Region  0 


Figure  1.  Eccentric  cylinders  of  radii  rj  0=1,.., M).  The  incident  plane  wave  makes  an  angle  6  0 
with  the  negative  z-axes  and  its  projection  on  the  x-y  plane  makes  an  angle  <f)0  with  the  negative 
x-axis. 
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The  vector  d}  is  directed  from  the  axis  of  cylinder  j  to  the  axis  of  cylinder  j+1  and  makes  an  angle  yj 
with  the  positive  x-axis.  We  will  consider  no-overlapping  cylinders,  i.e.;  the  cylinder  of  radius  rj+i  is 
always  inside  that  of  radius  rj.  Region  0  (j=0),  the  space  outside  the  outer  cylinder  (A  >  ^ is  a 
lossless  dielectric  with  o0  =  0 .  Region  1  (j=l)  is  the  space  occupied  by  the  region  between  the  outer 
cylinder  and  the  next  one  to  it.  Region  M  (j=M) is  the  region  occupied  by  the  inner  cylindrical  core, 
0  <  pM  <  rM  .  The  permittivity,  conductivity,  and  permeability  of  each  region  are  denoted  by  £j,Oj, 
and  jij ,  respectively.  The  propagation  vector  of  the  incident  wave  makes  an  angle  60  with  the 
negative  z  axes  and  its  projection  on  the  x-y  plane  makes  an  angle  <j)n  with  the  negative  x  axis.  The 
cylinders  are  infinitely  long  with  circular  cross  sections  and  with  arbitrary  materials. 


3.  Numerical  solution 

A  computer  program  is  written  to  calculate  the  projected  back  scattering  and  extinction  cross  sections 
from  an  eccentric  multi-layered  cylinder.  The  accuracy  of  the  developed  computer  program  is  verified 
for  different  special  cases.  The  single  homogeneous  cylinder  calculations  are  obtained  from  our  multi, 
eccentric  cylinder  computer  program.  In  addition,  we  were  able  to  duplicate  the  numerical  calculations 
of  two  eccentric  cylinders  recently  presented  in  [2],  Moreover,  the  extinction  cross  section  is  identical 
to  the  scattering  cross  section  when  the  materials  of  the  cylinders  are  not  absorbing  as  expected. 


4.  Numerical  calculations 

Numerical  calculations  of  the  projected  back  scattering  and  extinction  cross  sections  as  a  function  of 
the  angle  of  incidence  <f>0  were  generated.  We  define  the  projected  back  scattering  cross  section  o  by 
O  =  £)sin60,  where  Q  is  the  actual  back  scattering  cross  section.  Figure  2  shows  the  projected  back 
scattering  and  extinction  cross  sections  for  a  three-layered  eccentric  cylinder  as  a  function  of  4>o.  The 
radius  of  each  layer  is  1.0X,  0.8X,  and  0.6A,,  respectively,  where  X  is  the  incident  wavelength.  The 
plane  wave  is  incident  normally  on  the  axis  of  the  cylinder  0o=7t/2.  The  eccentric  parameters  di=0  and 
d2=0.2X;  the  orientation  angles  Yi=  72=  0.  The  relative  permittivites  of  outer  and  mid  layer  are  2,  and  3, 
respectively.  The  solid,  dashed,  dotted  solid,  and  chain  dotted  lines  are  for  the  inner  layer’s  relative 
permittivity  of  1,  2,  4,  and  6.5,  respectively.  The  effect  of  the  material  of  the  inner  layer  on  the  cross 
sections  is  quiet  evidence  from  this  figure.  The  figure  shows  that  both  the  projected  back  scattering 
and  extinction  cross  sections  are  very  sensitive  to  the  change  of  the  material  of  the  inner  layer  of  the 
cylinder. 
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Figure  2.  The  projected  back  scattering  and  extinction  cross  sections  as  function  <f>0  for  a  three- 
layered  cylinder  at  normal  incidence.  The  radius  of  each  layer  is  1.0X,  O.SX,  and  0.6X, 
respectively.  di=  0,  d2=0.2?i,  yi=  0,  and  y2=  0,  relative  permittivites  of  outer  and  mid  layers  are  2, 
and  3,  respectively.  The  solid,  dashed,  dotted  solid,  and  chain  dotted  lines  are  for  relative 
permittivity  of  the  inner  layer  of  1, 2, 4,  and  6.5,  respectively. 


We  have  also  investigated  the  effect  of  the  tilt  angle  on  the  extinction  and  projected  cross  sections. 
Figure  3  shows  these  cross  sections  as  functions  of  <j>o.  The  parameters  of  this  figure  are  the  same  as 
those  of  figure  2  except  that  the  tilt  angle,  0O,  is  7t/4.  Comparing  the  previous  two  figures  shows  a 
profound  change  in  the  scattering  patterns.  However,  the  projected  back  scattering  cross  section  shows 
more  sensitivity  than  the  extinction  cross  section  to  this  change. 
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Figure  3.  The  projected  back  scattering  and  extinction  cross  sections  as  a  function  of  <t>o.  The 
parameters  of  this  figure  are  the  same  as  those  of  figure  2  except  that  the  tilt  angle,  60,  is  Jt/4. 


We  further  considered  the  effect  of  the  eccentricity  parameters  on  the  cross  sections  of  a  three-layered 
cylinder.  Figure  4  shows  the  projected  back  scattering  and  extinction  cross  sections  as  a  function  of  <|>o 
for  normal  incidence  with  di=yi=y2=0.  The  radius  of  each  layer  of  the  cylinder  is  1.0X,  0.8X,  0.6X  and 
the  corresponding  relative  permittivity  of  each  layer  is  4,  3,  and  2,  respectively.  The  solid,  dashed, 
dotted  solid,  and  chain  dotted  lines  are  for  d2=0.2,  0.15,  0.1,  and  0,  respectively.  The  projected  back 
scattering  cross  section  shows  a  drastic  response  to  the  change  in  the  eccentricity  parameter. 
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Figure  4.  The  projected  back  scattering  and  extinction  cross  sections  as  a  function  of  4>o  for  ©  for 
a  three-layered  cylinder  at  normal  incidence  with  di=Yi=y2=0.  The  radius  of  each  layer  of  the 
cylinder  is  1.0A.,  0.8X,  0.6X  and  the  corresponding  relative  permittivity  of  each  layer  is  4, 3,  and  2, 
respectively.  The  solid,  dashed,  dotted  solid,  and  chain  dotted  lines  are  for  d2=0.2,  0.15,  0.1,  and 
0,  respectively. 


Figure  5  shows  the  cross  sections  as  a  function  <t>o-  The  parameters  of  this  figure  are  the  same  as  those 
of  figure  4  except  the  tilt  angle  is  60°.  The  curves  of  the  cross  sections  display  smooth  behaviors;  this 
is  especially  true  for  the  extinction  cross  section. 
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Figure  5.  The  projected  back  scattering  and  extinction  cross  sections  as  a  function  of  <j>o.  The 
parameters  of  this  figure  are  the  same  as  those  of  figure  4  except  the  tilt  angle  is  60°. 
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Using  the  Partial  Element  Equivalent  Circuit  (PEEC) 
Simulation  Technique  for  EMC  Applications 

Bruce  Archambeault 
IBM 


Introduction 

The  Partial  element  Equivalent  Circuit  (PEEC)  simulation  technique  was  first  introduced 
25  years  ago  [1].  It  has  mostly  been  used  for  quasi-static  partial  inductance  calculations 
until  recently.  New  developments  [2]  have  made  PEEC  a  full-wave  tool  with  time 
domain  capabilities. 

Modeling  and  simulation  is  not  new  to  EMC  applications.  There  have  been  many  articles 
m  the  literature  using  the  method  of  Moments  (MoM),  the  Finite-Difference  Time- 
Domam  (FDTD)  technique,  the  Transmission  Line  Method  (TLM)  or  the  Finite  Element 
method  (FEM).  Each  of  these  different  techniques  have  areas  where  their  individual 
strengths  make  them  the  best  choice  for  a  particular  modeling  problem,  and  each  has 
areas  where  they  are  not  appropriate.  Therefore,  a  tool-box  strategy  is  necessary,  where 
the  engineer  has  more  than  one  modeling  technique  available,  allowing  the  use  of  the 
proper  technique  for  the  given  task. 

PEEC  has  recently  become  a  popular  alternative  to  some  of  the  more  traditional  modeling 
techniques.  One  of  it’s  biggest  advantages  is  the  easy  ability  to  add  circuit  elements  into 
an  electromagnetic  simulator,  making  it  a  preferred  choice  for  printed  circuit  board 
simulations  which  include  lumped  circuit  elements,  such  as  inductance,  capacitance,  and 
resistance.  A  number  of  examples  of  PEEC’s  usefulness  in  EMC  modeling  applications 
will  be  presented  here. 

Brief  Description  of  PEEC 

There  are  a  number  of  technical  articles  describing  the  PEEC  technique  in  detail,  and 
those  will  not  be  repeated  here.  However,  a  very  brief  description  will  be  presented. 

PEEC  is  based  on  the  integral  equation  formulation.  All  the  structures  to  be  modeled  are 
divided  into  electrically  small  elements.  The  coupling  between  each  element  is  described 
as  an  equivalent  circuit  shown  in  Figure  1 .  Once  the  matrix  of  equivalent  circuits  are 
developed,  then  a  SPICE-like  circuit  solver  is  used  to  for  the  response  of  the  system  to 
the  simulation  applied  to  it. 


Since  the  solution  is  a  circuit  based  solution,  then  individual  circuit  elements,  such  as 
resistance,  capacitance,  and  inductance  can  be  easily  added  to  any  set  of  elements  or 
nodes. 
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EMC  Applications 

There  are  a  variety  of  EMC  applications  for  the  PEEC  technique.  Since  individual  circuit 
elements  can  be  added  to  the  model  easily,  simulations  involving  printed  circuit  (PC) 
boards  are  straightforward  to  create. 

Crosstalk  Simulations 

One  of  the  most  serious  concerns  of  an  EMC  analysis  of  a  PC  board  is  the  crosstalk 
between  a  critical  net  (such  as  a  clock  net)  and  an  I/O  net  which  could  potentially  provide 
a  convenient  path  for  the  high  frequency  harmonics  of  the  clock  signal  to  escape  the 
shielded  enclosure.  Figure  2  shows  a  diagram  of  an  example. 


External  Cable 


^  PC  Board 

Figure  2  Crosstalk  Example 


The  question  about  how  much  signal  reduction  is  provided  by  additional  distance 
between  parallel  traces  is  often  debated.  Also,  the  amount  of  benefit  obtained  by  using  a 
ground-guard  trace  on  either  side  of  a  critical  net  vs.  separation  distance  (without  the 
ground-guard  trace)  is  not  clear. 
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A  simple  model  was  created  (Figure  3)  to  illustrate  how  the  undesired  current  coupled 
into  the  I/O  net  would  be  affected  by  increasing  the  separation  between  nets,  or  simply 
placing  a  ground-guard  trace  between  the  critical  and  I/O  net. 


Figure  3  Crosstalk  Model  Configuration 

Figure  4  shows  the  result  of  the  PEEC  model  as  the  I/O  net  was  moved  further  from  the 
driving  net.  The  PEEC  time  domain  result  has  been  transformed  into  the  frequency 
domain  using  a  FFT,  and  then  normalized  to  the  amount  of  crosstalk  current  present  in 
the  I/O  trace  when  the  I/O  trace  was  as  close  as  possible  to  the  driving  net.  The  change  in 
the  amount  of  crosstalk  coupling  is  apparent  as  the  separation  between  traces  is  increased. 
When  the  ground-guard  trace  is  used,  the  crosstalk  coupling  is  less  than  with  the 
maximum  separation. 


Crosstalk  Current  Additional  Loss  Relative  to  Adjacent  Trace 


Figure  4  Crosstalk  Current  vs  Separation 
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Cascade  Crosstalk  Simulations  (Multi-level  Crosstalk) 

In  addition  to  ‘normal’  crosstalk  analysis  as  shown  in  the  previous  section,  cascaded 
crosstalk,  or  multilevel  crosstalk  is  also  a  consideration  to  EMC  engineers.  Figure  5 
shows  a  diagram  of  a  cascaded  crosstalk  configuration.  As  can  be  seen,  the  clock  net  is 
not  close  to  the  I/O  net,  but  another  net  becomes  a  fortuitous  conductor  between  them. 
This  can  allow  some  amount  of  clock-related  signal  voltage  at  the  I/O  connector,  which 
then  becomes  a  common-mode  voltage  (relative  tot  he  outside  of  the  shielded  enclosure), 
and  a  potential  EMI  emissions  problem. 

Figure  6  shows  the  results  of  an  example  using  PEEC  to  analyze  the  amount  of  unwanted 
signal  level  at  the  I/O  connector  due  to  cascaded  crosstalk  as  the  amount  of  parallel  traces 
between  the  clock  trace  and  fortuitous  conductor  is  varied.  The  time  domain  waveforms 
from  the  PEEC  analysis  were  converted  to  the  frequency  domain  for  convenient  analysis. 


Example  of  Cascade  Crosstalk  at  I/O  Connector 
for  Various  Parallel  Lengths 


Figure  6  Cascade  Crosstalk  Output  Example 


Zig-Zag  Trace  Analysis 

Another  EMC  application  of  PEEC  modeling  is  given  in  the  example  of  the  analysis  of 
zig-zag  traces.  In  order  to  ensure  clock  signals  arrive  at  all  destination  at  the  same  instant 
in  time,  traces  are  often  put  through  a  zig-zag  pattern  to  increase  the  trace  length  to  match 
the  longest  clock  trace.  Once  the  traces  are  all  the  same  length,  then  the  clock  signal  will 
arrive  at  all  destinations  simultaneously. 

There  has  been  a  considerable  amount  of  debate  concerning  whether  the  length  of  the  zig¬ 
zag  makes  a  difference  to  EMC  emissions  from  the  traces,  especially  on  four  layer  PC 
boards  where  the  zig-zag  pattern  is  exposed.  Figure  7  shows  examples  of  different  zig¬ 
zag  patterns  which  are  commonly  used.  For  this  example,  both  zig-zag  patterns  have  the 
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same  overall  trace  length.  In  real-world  designs,  this  pattern  can  be  anything  where  the 
required  length  is  achieved. 


Zig-zag  pattern  #1 


Zig-zag  pattern  #2 


Figure  7  Zig-Zag  Patterns 


In  addition  to  the  EMC  emissions  concerns  about  the  zig-zag  pattern,  there  is  also  a 
signal  integrity  concern.  PEEC  was  used  to  analyze  the  time  domain  waveshape  on  a 
number  of  different  zig-zag  patterns.  In  the  first  example,  the  trace  is  assumed  to  be 
perfectly  terminated  in  its  characteristic  impedance.  The  difference  in  the  waveshape 
arriving  at  the  receiver  is  shown  in  Figure  8.  One  of  the  advantages  of  the  PEEC 
technique  is  the  easy  ability  to  add  or  change  circuit  elements  within  an  electromagnetic 
model.  In  the  above  example,  the  termination  was  changed  from  a  resistor  to  a  10  pF 
capacitor  (representative  of  the  termination  impedance  of  CMOS  devices),  and  the  net 
was  terminated  at  the  source  with  a  series  impedance. 

Since  the  receive  end  of  the  net  is  not  properly  terminated,  a  reflection  is  expected.  The 
source  series  termination  is  designed  to  match  the  net  characteristic  impedance  as  to 
inhibit  any  secondary  reflection  from  the  source  end.  Figure  9  shows  the  near-end  time 
domain  waveshape  for  the  two  zig-zag  patterns.  Clearly  there  is  a  significant  difference 
between  the  cross-talk  coupling  effects  between  the  two  zig-zag  patterns.  This  distortion 
of  the  initial  pulse  contributes  additional  (and  unwanted)  harmonic  energy  to  the  original 
signal.  This  additional  harmonic  energy  will  in  turn  contribute  to  additional  direct 
emissions,  as  well  as  additional  common  mode  emissions. 

Summary 

The  need  for  a  variety  of  modeling  techniques  for  EMC  applications  has  been  well 
documented.  Each  of  the  various  techniques  has  a  different  strength  that  the  EMC 
engineer  can  use.  PEEC  has  not  been  used  for  EMC  applications  until  recently.  One  of 
the  strong  advantages  of  using  PEEC  is  the  convenience  of  including  circuit  based 
elements.  A  number  of  different  examples  were  PEEC  was  used  for  EMC  analysis  were 
presented. 
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Source-End  Time  Domain  Result  from  Zig-zag  Pattern  with 
10  pF  Capacitor  Load 


01  23456789  10 

Time  (ns) 


Figure  8  Zig-zag  Pattern  time  Domain  Result  for  Matched  Load 
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Figure  9  Zig-zag  Pattern  time  Domain  Result  for  Unmatched  Load 


Load-End  Time  Domain  Result  from  Zig-zag  Pattern  with 
Matched  Load 


Analytic  Validation  of  a  Low-Frequency  Magnetic  Induction  Code  with 

Nonuniform  Sources 


Trevor  W.  Dawson 

Dept.  Elec.  &  Comp.  Eng.,  University  of  Victoria 
PO  Box  3055  Stn  CSC  Victoria  BC  Canada  V8W  3P6 


Abstract 

This  paper  considers  the  problem  of  extremely  low-frequency  induction  in  a  homogeneous  sphere  by  an  exter¬ 
nal  line-current  source.  The  problem  has  an  analytic  closed-form  solution  that  exhibits  a  fully  three-dimensional 
electric  field.  The  solution  is  particularly  useful  for  testing  more  general-purpose  numerical  modeling  codes. 
Earlier  analytic  solutions  have  provided  tests  for  induction  in  homogeneous  bodies  by  homogeneous  magnetic 
fields,  as  well  as  induction  in  inhomogeneous  spheres,  also  by  uniform  fields.  These  models  exercise  the  ba¬ 
sic  implementation  of  the  numerical  implementation,  as  well  as  the  handling  of  conductivity  gradients.  This 
new  model  provides  an  additional  test  of  the  source  implementation.  It  has  a  particularly  useful  application 
in  the  validation  of  computer  models  for  induction  in  humans  by  power  transmission  lines.  In  this  paper, 
results  computed  using  an  established  scalar-potential  finite-difference  code  are  compared  with  the  analytic 
values.  Excellent  agreement  is  obtained,  apart  from  surface  field  discrepancies  associated  with  the  staircasing 
approximation  used  for  the  sphere. 


1  Introduction 

When  the  source  frequencies  are  sufficiently  low,  the  induction  in  compact  isolated  conductors  can  be  considered 
in  terms  of  separate  contributions  due  to  electric  and  magnetic  sources  [Olsen,  1994].  The  electric  field  inside  the 
conductor  may  be  represented  in  terms  of  an  unknown  scalar  potential  using  Stevenson’s  method  [  Van  Bladel, 
1985] .  The  applied  magnetic  field  manifests  itself  through  a  vector  potential  serving  as  an  electric  field  source.  The 
resulting  mathematical  equations  can  give  rise  to  a  variety  of  numerical  schemes,  including  the  Scalar  Potential 
Finite  Difference  (SPFD)  method  [Dawson  et  al.,  1997].  It  is  desirable  to  validate  any  numerical  solution,  preferably 
by  comparison  with  an  analytical  solution.  Suitable  analytic  solutions  are  relatively  rare,  particularly  for  fully  three- 
dimensional  problems.  The  solution  for  uniform  magnetic  induction  in  a  homogeneous  sphere  is  elementary,  but 
of  restricted  utility.  There  are  recently  published  solutions  for  an  uniform  magnetic  induction  in  an  equatorially 
stratified  sphere  [Dawson  and  Stuchly,  1997;  Dawson,  1997].  These  serve  to  validate  the  handling  of  conductivity 
gradients  in  a  numerical  code,  and  were  recently  applied  to  a  scalar  potential  finite  difference  code  [Dawson  and 
Stuchly,  1996].  This  code  is  being  extensively  applied  to  the  evaluation  of  60-Hz  induction  in  the  human  body 
[Dawson  and  Stuchly,  1998;  Dawson  et  al.,  1997]  by  uniform  magnetic  fields.  However,  uniform  magnetic  excitation 
is  not  a  suitable  source  model  for  occupational  exposures  faced  by  utility  workers.  Here,  linear  current  elements, 
both  finite  and  infinite  in  extent,  axe  more  suitable.  It  is  desirable  to  ensure  that  such  a  source  implementation  in 
a  numerical  code  is  correct  by  comparison  with  an  additional  method,  preferably  analytical. 

This  paper  presents  such  a  validation  for  induction  in  a  homogeneous  conducting  sphere  by  a  line  current 
source.  The  problem  has  a  simple  closed-form  solution.  The  result  is  compared  with  data  computed  using  the 
scalar  potential  finite  difference  code.  It  is  shown  that  good  agreement  is  obtained,  thereby  providing  the  requisite 
validation. 
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2  Model  and  Solution 

Coordinates  are  defined  with  respect  to  a  Cartesian  coordinate  system  {x,y,  z).  Coincident  spherical  ^ 

z>  5iU  .iso  be  freely  ueed.  A  notation  such  as  i  defrnes  a  umt  voctor  a^cmtod  wfrfr 
^corresponding  cooXIe.  The  problem  consists  of  an  isolated  homogeneous  sphere  of  rad,^  a  >  °  “d  c°^ 
ductivity  o0  >  0,  centred  at  the  coordinate  origin.  All  fields  are  assumed  time-harmonic  with  period  T  frequency 
f  Id  Ti  frequency  w  =  2 nf.  A  common  time  factor  «+**  is  therefore  assumed  and  suppressed  throughout. 
Field  lectors  denotedherein  as  uppercase  bold  quantities,  therefore  represent  complex  phasor  amphtudes^  A 
qlistott  approximation  is  assumed,  so  that  the  sphere  is  small  compared  both  to  the  source  wavelength  and  to 

the  internal  skin  depth. 

The  problem  it  formulated  usinf  Stevenson’s  method  [Ton  Biadel,  1985],  whidi  hb»d»» 
r  t Lap  f;Gi/4s  in  a  Dower  series  ordered  by  source  wavenumber  fco  —  u/c o»  or  equivalent  y,  y  <2  y 
denotes  the  speed  of  light  in  the  vacuum  surrounding  the  sphere.  Thus  the  zeroth-order  fields  represen  e  s  a  ic 
limit.,  first-order  fields  vanish  linearly  with  frequency  approaching  zero,  etc.. 

The  sphere  is  under  the  influence  of  an  applied  line  current  density  source,  whose  zeroth-order  form  near 
the  plane  a  =  0,  and  in  the  neighborhood  of  the  sphere,  may  be  assumed  to  have  the  (static)  form 

J%{x)  =  ioS(x  -b)S(y)z,  t1) 

where  to  denotes  the  peak  instantaneous  current.  Thus  the  line  source  lies  parallel  to  the  z-axis  and  intercepts  the 
plane  z  =  0  at  the  point  *  =  b,  y  =  0,  with  b  >  a.  This  last  inequality  ensures  that  the  source  is  external  to  the 

sphere.  The  dimensionless  parameter 

0  <  K  =  a/b  <1  w 

is  used  freely  throughout  the  derivation. 

This  work  considers  only  magnetic  excitation,  hence  the  zeroth-order  external  electric  field  is  zero  Fur¬ 
thermore  it  follows  from  the  general  results  of  Stevenson’s  method  that  the  static  limit  of  the  electric  field  is  zero 
within  the  conducting  sphere.  The  zeroth-order  electric  field  is  therefore  zero  everywhere, 

E0(x)  =  0.  (3) 

Similarly,  the  zeroth-order  magnetic  field  is  unperturbed  by  the  presence  of  the  conducting  sphere,  and  satisfies 
the  classical  magnetostatic  problem 

V  x  H0(x)  =  Je0(x) ,  ’  V.Ho(x)  =  0,  (4) 

in  the  entire  domain.  The  simple  solution  has  the  Cartesian  representation 

„  M  _  ( io ^  [(g-frJft-ygl  (5) 

This  can  be  expressed  in  terms  of  a  vector  potential  [Stratton,  1941) 

H0(x)  =  V  x  A0(*) ,  where  V-A0(a:)  =  0,  (6) 

and  it  is  readily  verified  that 

\  ,  l(x-b)2  +  y 2  ^  (p^n  cosnip 

A0{x)=[-f)Lb(x,y)z,  where  Lb(x,y)  =  - lnW - p  X,  U  n  ' 


is  an  appropriate  solution.  The  latter  form  for  Lb  (*,  y)  follows  from  a  Taylor  expansion  of  the  function  In  (1  -  <) 
in  the  disk  (|<|  <  1). 

The  first-order  internal  electric  field  satisfies  the  differential  equations 

V  x  E\(x)  =  -juJHoHo{x)  and  V-  [EJ(*)]  =0,  (r  <  a) .  (8) 


144 


subject  to  a  boundary  condition  of  tangential  surface  current  flow, 

f-£t1(iE)=0,  (r-fl).  (9) 

The  second  member  of  (8)  arises  from  conservation  of  internal  first-order  current  density,  V*  [Jj(®)]  =  0,  in  the 
presence  of  a  uniform  conductivity.  The  internal  electric  field  may  be  written  in  the  form 

E\(x)  =  -j  (^)  £'(*) ,  where  £\x)  =  zLb  (x,y)  -  bVip\(x) ,  (10) 

involving  a  dimensionless  electric  field  E\(x)  and  scalar  potential  ^j(x).  The  second  member  of  (8)  together  with 
the  boundary  condition  (9)  shows  the  scalar  potential  to  satisfy  the  boundary  value  problem 

VVi(*)=0,  (r<o),  =  cos0£6(s,y),  (r  =  a) ,  (11) 

within  the  sphere.  The  boundary  value  problem  can  be  solved  using  a  simple  two-dimensional  method.  Specifically, 
if  a  form 

V’i(s)  =  (z/b)Qm  (p,<p)  (12) 

is  assumed,  then  the  latter  function  satisfies  the  two-dimensional  boundary  value  problem 

V2Qm  (p,  <p)  =  0,  {1  4-  pdf,}  Qm  {p,  <p)  =  Lb  (x,  y) ,  {r  =  a}.  (13) 

The  solution  is  readily  obtained  as  follows.  Separated  solutions  of  (13a)  regular  at  p  =  0  and  periodic  are  necessarily 
members  of  the  set  {pm  cos(my>),  pm  sm(m<p)  j  m  =  0, 1, . . .}.  It  is  then  straightforward  to  use  the  expansion  (7b) 
and  the  boundary  condition  (13b)  to  verify  that 


and  hence  that 


Qm  ( P,<P ) 


^  fp\n  cos  nip 

^Kb)  JtfT+ij’ 


(r  <  a) , 


x+jy 

b 


(14) 


(15) 


is  a  solution  for  the  scalar  potential.  The  series  and  closed-form  expressions  are  both  useful.  Expressions  for  the 
electric  field  follow  from  an  application  of  (10)  in  the  appropriate  coordinate  system.  In  particular,  the  spherical 
coordinate  radial  component  of  the  internal  electric  field  is  identically  zero  within  the  sphere. 

The  series  solution  leads  to  the  analytic  value 

K  («2)  =  y  J[[v  (£'£)  dv  =  (2^3)  {I K  +  T  k3  ~  arcsin(«)  [*  axcsin(K)  -  ^  (2  +  «2)  y/l  -  «2J  |  (16) 

for  the  mean-square  dimensionless  internal  first-order  electric  field  over  the  sphere.  As  indicated  in  the  notation, 
the  right-hand  side  depends  only  on  the  square  of  the  source-sphere  separation  parameter  n.  This  quantity  is 
required  for  computation  of  the  physical  values  of  the  peak  instantaneous  volumetric  mean-square  internal  electric 
field,  mean-square  internal  current  density,  and  average  power  deposition  in  the  sphere. 


3  Results 


The  particular  model  under  consideration  consist  of  a  sphere  of  radius  a  =  0.5  m.  The  line  source  is  located  at 
6  =  1  m,  for  a  separation  of  0.5  between  the  line  and  sphere  surface.  The  dimensionless  ratio  is  therefore  «  =  0.5. 
The  line  source  has  frequency  /  =  60  Hz  and  carries  a  peak  current  70  =  1  A.  The  sphere  conductivity  does  not 
enter  into  the  electric  field  computation.  For  the  homogeneous  sphere  under  consideration,  the  current  density  is 
a  simple  multiple  of  the  electric  field. 
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Surface  Electric  Field 


s 


M 


Figure  1:  Depiction  of  the  electric  field  (and  hence  current  flow)  on  the  surface  of  the  sphere.  Angular  coordinates 
are  in  degrees. 


The  numerical  calculations  are  based  on  a  staircased  sphere  having  cubic  voxels  with  1  cm  edges,  so 
that  the  sphere  diameter  is  100  voxel-widths.  The  SPFD  code  uses  a  conjugate  gradient  solver  applied  to  a  sparse 
symmetric  matrix.  The  solution  leads  to  electric  field  components  defined  on  the  staggered  grid  formed  by  the  voxel 
edges  A  four-point  average  is  applied  to  each  of  the  three  sets  of  four  parallel  edge  field  components  associated 
with  each  conducting  voxel.  This  results  in  an  electric  field  vector,  defined  in  terms  of  its  Cartesian  components,  at 
every  point  on  the  grid  formed  by  the  conducting  voxel  centers.  For  comparison  purposes,  a  second  set  of  Cartesian 
field  vectors  is  computed  on  the  identical  three-dimensional  array  of  points  from  the  analytic  solution.  This  allows 
a  voxel-by-voxel  comparison  between  the  numerical  and  analytic  data  sets. 

It  may  be  recalled  that  the  current  pattern  induced  by  a  uniform  field  in  a  uniform  sphere  has  circular 
current  paths  normal  to  the  magnetic  source  vector,  with  field  magnitude  varying  only  in  the  transverse  direction. 
The  current  pattern  induced  by  the  present  line  source  is  similar,  being  dominated  by  a  vortex  system  whose  axis 
is  curved  in  the  plane  z  =  0.  The  field  pattern  is  given  in  Figure  1,  which  shows  the  surface  current  flow  pattern 
for  the  test  model  using  a  pseudo-Cartesian  representation.  Angular  coordinates  are  in  degrees.  The  vector  length 
is  proportional  to  the  local  field  magnitude  (in  arbitrary  units)  so  that  the  higher  field  values  near  <p  =  0  are 
clearly  evident.  The  lighter  lines  in  the  figure  lie  at  9  =  tt/2  (corresponding  to  the  equator  of  the  sphere)  and  at 
tp  =  -j-tt/2  Under  induction  by  a  uniform  source  directed  along  the  p-axis,  the  core  of  the  induced  current  vortex 
would  intercept  the  surface  of  the  sphere  at  the  intersections  of  these  lines.  For  the  line  source,  the  vortex-surface 
intercepts  move  closer  to  tp  —  0. 

To  provide  a  preliminary  demonstration  of  the  agreement  between  the  two  computations,  Table  I  presents 
three  values  of  the  volumetric  root-mean-square  of  the  induced  electric  field.  Units  are  in  microvolts  per  meter. 
The  analytic  value  (first  row)  is  computed  from  the  exact  solution  (16).  The  numerical  value  in  the  second  row  is 
obtained  using  a  discrete  summation  over  the  appropriate  voxel-center  electric  fields  magnitudes.  The  final  entry 
in  the  table  is  computed  using  the  same  discrete  summation  method,  but  applied  to  the  samples  of  the  analytic 
solution.  The  difference  between  the  values  in  the  first  two  lines  of  the  table  is  due  to  spatial  mismatch  between 


146 


Table  I:  Comparison  of  root  mean  square  electric  field  within  the  sphere. 
Analytic  value  12.2074 
Numerical  value  12.2314 
Discrete  analytic  12.2280 


Figure  2:  Comparison  of  the  analytic  (top  row) ,  numerical  (middle  row)  and  difference  (bottom  row)  electric  field 
magnitudes  in  the  Cartesian  planes  z  =  0  (left  column),  y  =  0  (middle  column)  and  x  =  0  (right  column).  Units 
on  the  shading  scale  are  microvolts  per  metre. 


the  analytic  and  voxel-based  spheres,  and  the  associated  differences  in  surface  current  flow  paths  between  the  true 
and  staircased  spheres.  That  the  discrepancy  is  due,  in  part,  to  the  geometric  differences  between  the  models  is 
further  supported  by  the  closer  agreement  between  the  last  two  rows  of  the  table,  where  the  differences  arise  in 
the  third  decimal  place. 

Figure  2  provides  an  additional  visual  indication  of  the  induced  field  pattern,  as  well  as  a  comparison 
between  field  values  computed  using  the  analytical  and  numerical  solutions.  The  figure  shows  a  shaded  represen¬ 
tation  of  the  field  magnitudes  in  the  three  principal  Cartesian  coordinate  planes.  The  top  and  middle  rows  pertain 
to  the  analytic  and  numerical  data  sets,  respectively.  Field  magnitudes  are  indicated  in  the  associated  color  scales, 
with  units  being  microvolts  per  meter.  From  left  to  right,  the  three  columns  pertain  to  the  planes  z  —  0,  y  =  0 
and  z  —  0,  respectively.  The  line  source  is  perpendicular  to  the  plane  of  the  first  column,  and  is  is  one  radius  to 
the  right  of  the  sphere  surface.  The  curvature  of  the  current  vortex  system  axis  in  the  plane  z  =  0  is  evident  in 
the  first  column.  The  bottom  row  plots  the  voxel-wise  difference  fields.  It  is  immediately  evident  that  significant 
discrepancies  are  restricted  to  the  sphere  surface,  where  the  staircasing  errors  are  most  pronounced. 

Finally,  Table  II  provides  a  numerical  comparison  between  the  two  electric  field  solutions,  in  terms  of  the 
Cartesian  components,  as  well  as  the  field  magnitude.  The  data  comprise  5  spatial  measures,  including  2  local 
(minimum  and  maximum)  and  3  global  (average,  standard  deviation  and  average  absolute  value)  quantities.  The 
latter  is  only  of  interest  when  applied  to  the  Cartesian  components.  Units  are  in  microvolts  per  meter.  These 
data  are  shown  for  the  analytic  (upper  section)  and  numerical  (second  section)  solutions,  as  well  as  for  the  voxel- 
wise  difference  fields  (third  section).  The  bottom  row  of  the  table  indicates  the  full  three-dimensional  correlation 
coefficients  between  the  two  fields,  taken  over  all  conducting  voxels.  The  table  shows  that  pointwise  discrepancies 
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Table  II:  Comparison  of  analytical  and  numerical  computations. 


Ex 

Ey 

Ez 

\E\ 

Min. 

-20.003 

-3.366 

-28.663 

0.000 

Max. 

20.003 

3.366 

14.189 

28.909 

Analytic 

Avg. 

0.000 

0.000 

0.000 

5.530 

Std.  Dev. 

5.975 

0.772 

6.130 

6.580 

Avg.  Abs. 

3.491 

0.404 

3.516 

5.530 

Min. 

-25.058 

-5.048 

-35.243 

0.000 

Max. 

25.058 

5.048 

17.632 

35.947 

Numeric 

Avg. 

0.000 

0.000 

0.000 

5.520 

Std.  Dev. 

5.964 

0.777 

6.116 

6.566 

Avg.  Abs. 

3.486 

0.407 

3.509 

5.520 

Min. 

-8.325 

-3.641 

-9.491 

-7.856 

Max. 

8.325 

3.641 

7.482 

7.520 

Ana  -  Num 

Avg. 

0.000 

0.000 

0.000 

0.010 

Std.  Dev. 

0.209 

0.112 

0.218 

0.233 

Avg.  Abs. 

0.034 

0.020 

0.035 

0.038 

Correlation 

0.9994 

0.9896 

0.9994 

0.9996 

can  be  of  the  order  25%.  However,  as  indicated  in  Figure  2,  these  are  confined  to  the  sphere  surface.  The  global 
measures  in  the  table  confirm  the  good  overall  agreement  between  the  two  computations,  and  hence  the  correctness 
of  the  line  current  source  implementation  in  the  numerical  code. 

4  Closing  Remarks 

This  paper  has  presented  a  comparison  between  numerical  and  analytical  solutions  to  the  problem  of  low-frequency 
induction  in  a  uniform  conducting  sphere  by  a  line  current  magnetic  source.  This  problem  is  of  relevance  in  the 
verification  of  computer  models  for  the  evaluation  of  low  frequency  induction  in  humans  by  power  transmission 
lines.  It  is  shown  that  the  numerically  computed  results  are  in  excellent  agreement  with  the  analytic  values,  apart 
from  surface  discrepancies  related  to  the  staircasing  approximation  used  for  the  sphere. 
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Abstract—  In  this  study,  a  dual  coordinate  coupling  method  is  proposed  for  implementation  of  finite  element 
and  cylindrical  PML  formulation.  This  method  takes  full  advantages  of  the  finite  element  method  and  the 
cylindrical  PML  method  avoiding  numerical  integration  normally  required  by  the  transformation  of  different 
coordinates.  In  this  method,  different  coordinate  systems  are  used  in  different  regions,  cylindrical  based 
sectorial  elements  in  PML  region,  and  Cartesian  coordinate  based  finite  elements  in  scatterer  and  buffer 
regions.  Compared  with  traditional  coordinate  transformation  method,  this  method  is  more  efficient  and 
accurate. 


i.  Introduction 


In  the  past  a  few  years,  Perfectly  Matched  Layer  (PML)  techniques  have  attracted  wide  interest  in  truncating 
infinite  space  into  finite  space  for  effective  solutions  of  electromagnetic  scattering  and  radiation  problems  [l]-[3]. 
Finite  Element  Method  (FEM)  has  proven  to  be  a  powerful  tool  in  treating  problems  with  complex  geometry  and 
material  profile.  The  combination  of  FEM  and  PML  takes  the  advantages  of  both  techniques  and  is  considered  as 
a  promising  approach  for  solving  radiation  and  scattering  problems  which  involve  objects  with  arbitrary  geometry 
and  material  parameters. 

An  ideal  PML  comprises  of  an  anisotropic  medium,  whose  complex  permittivity  and  permeability  tensors  e  and 
f  are  chosen  in  such  a  way  that  it  can  absorb  an  arbitrary  incident  eletromagnetic  wave  without  any  reflection. 
PMLs  were  originally  derived  for  Cartesian  coordinates  and  planar  interfaces,  in  which  the  plane  wave  of  arbitrary 
frequency  and  angle  is  incident  from  half-free  space  upon  the  PML  half-space.  Recently,  PMLs  have  been  extended 
to  cylindrical  and  spherical  coordinates  [4], [5]  in  order  to  solve  curvilinear  scattering  problems  more  effectively.  In 
these  circumstances,  a  general  curved  vacuum-layer  interface  is  more  suitable  than  a  planar  interface.  The  use  of 
cylindrical  PML  in  these  cases  can  reduce  computational  cost  and  increase  result  accuracy.  Although  the  cylindrical 
PML  has  its  advantage  in  solving  unbounded  problem,  it  encounters  some  implementation  difficulties  because  the 
cylindrical  PML  is  normally  expressed  in  cylindrical  coordinate,  while  the  scatterer  tends  to  be  expressed  in 
Cartesian  coordinate. 

In  this  paper,  a  dual-coordinate  coupling  element  method  is  proposed  to  solve  this  problem.  In  this  study,  different 
kinds  of  elements  are  used  in  different  regions.  The  sectorial  elements  are  used  to  subdivide  the  PML  region, 
and  triangular  or  quadrilateral  elements  axe  used  to  subdivide  the  region  enclosed  by  the  PML.  These  two  kinds 
of  elements  are  coupled  at  their  interfaces.  In  this  study,  the  performance  of  this  coupling  element  method  is 
compared  with  the  traditional  method  which  transfers  the  different  coordinates  into  one.  Numerical  results  show 
that  the  coupling  method  is  more  effective  than  the  traditional  method  in  the  implementation  of  cylindrical  PML 
with  Finite  Element  Method  (FEM). 
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II.  Implementation  of  Cylindrical  PML 


For  a  scatter  with  arbitrary  shape  and  inhomogeneous  material  profile,  we  can  use  a  hybrid  FEM/PML  approach. 
As  shown  in  Fig.  1,  the  cylindrical  PML  is  convenient  for  cylinder-like  scatters.  Using  cylindrical  PML  can  minimize 
the  space  between  the  PML  layer  and  the  scatterer,  therefore  reduce  finite  element  region  and  the  computational 
effort. 


Finite  element  meshes  are  normally  expressed  in  in  Cartesian  coordinate  while  the  cylindrical  PMLs  are  best 
expressed  in  cylindrical  coordinate.  In  the  cylindrical  PML,  the  material  properties  of  the  cylindrical  PML  is 
expressed  as: 


JL 

IM> 


e 

eo 


s^Sz/sp  0  0 

0  SpSz/s^,  0 

0  0  sps^/s2. 


(1) 


where  sp  =  1  -  j/3p,  sz  =  l  —  jfiz  and  s#  =  (p0  4-  JJq  spdp)lp. 

The  traditional  method  transfers  the  cylindrical  PML  into  Cartesian  coordinate,  so  that  the  same  coordinate  system 
is  used  for  both  the  FE  and  PML  segmentations.  The  transformation  from  cylindrical  coordinate  to  Cartesian 
coordinate  is  shown  as  follows. 

Suppose  the  complex  permittivity  and  permeablity  tensors  of  PML  medium  are  f  =  e[A]  and  p  =  //[A],  where 
[A](Pi^2)=diag(<x,  b,  c).  It  is  necessary  to  calculate  the  representation  of  [A](2]Vi2)  with  respect  to  the  canonical  basis 
in  ( x,y,z )  system,  since  the  governing  partial  differential  equations  axe  given  in  terms  of  the  variables  (x,y,z). 

The  representation  of  \A\(x,y,z)  in  the  {x,y,  z)  system  is  given  by 

[A](x,y,z)  =  JT[A}M,z)J  (2) 


where  J  is  the  transformation  Jacobian. 
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The  e  and  fi  at  the  point  P(p,  (/>,  z)  are: 


I  =  JL 

eo  A*o 


a  cos2  <j>  +  b  sin2  <f>  (a  -  b)  cos  <j>  sin  <j>  0 

(a  —  b)  cos  4>sm<t>a  sin2  4>  +  b  cos2  4>  0 

0  0  c  . 


(3) 


In  cylindrical  PML,  the  parameters  a,  b  and  c  in  [A](Pi(M)  axe  different.  Hence,  the  PML  medium  is  inhomogeneous 
in  {x,y,z)  system,  and  numerical  integrations  are  needed.  Apparently,  this  method  increases  the  computation  cost 
and  error,  which  counteracts  the  advantage  of  introducing  and  applying  the  cylindrical  PML. 


in.  Coupling  Element  Method 

To  take  the  full  advantage  of  the  cylindrical  PML,  a  simple  and  effective  implementation  of  this  method  is  proposed 
in  this  study.  In  this  method,  both  of  the  Cartesian  and  cylindrical  coordinate  systems  are  used  simultaneously,  and 
the  transformation  is  avoided.  Finite  elements  based  on  the  Cartesian  coordinate  system  are  used  for  the  scatterer, 
and  the  cylindrical  sectorial  elements  based  on  cylindrical  coordinate  system  are  used  in  the  PML  region.  The  two 
types  of  elements  are  coupled  at  the  FE  and  PML  interface. 

According  to  the  FEM  theory,  a  basic  requirement  of  the  discretization  is  that  there  should  be  no  overlaps  and  gaps 
between  neighbor  elements.  Furthermore,  the  elements  should  be  connected  via  their  vertices,  or  in  other  words, 
a  vertex  of  one  element  is  also  an  vertex  of  the  neighboring  elements.  In  this  coupling  method,  the  connection  of 
the  elements  at  the  interface  is  shown  in  Fig.  2. 

Even  though  different  kinds  of  elements  are  used,  the  neighbor  elements  are  connected  via  their  vertices.  Although 
there  exist  small  gaps  between  sectorial  elements  and  finite  elements,  this  error  can  be  ignored  when  the  element 
meshing  is  fine  enough. 


The  coupling  between  the  two  formulations  of  these  two  kinds  of  elements  should  be  assured  by  using  suitable 
boundary  conditions.  In  finite  element  method,  the  continuity  of  the  tangential  component  of  the  field  must  be 
assured  across  the  boundaries  of  the  elements,  so,  it  must  be  satisfied  on  the  interfaces  between  the  two  different 
elements.  Fortunately,  at  the  interface  between  medium  1  and  medium  2,  we  have  the  following  boundary  condition 

n  x  (Ei  —  E2)  ~  0  ,  h  x  {Hi  -  H2)  =  0  (4) 

Since  the  two  different  formulations  are  in  different  media,  the  connection  of  these  two  elements  must  be  at  the 
interface.  According  to  (4),  the  tangential  continuous  condition  of  two  element  is  satisfied  between  two  kinds  of 
elements,  no  matter  whether  the  problem  is  TM  or  TE  problem. 
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Taking  2-dimensional  problem  as  example,  in  the  region  expressed  by  Cartesian  coordinate  system,  triangular  or 
quadrilateral  elements  are  used.  In  the  cylindrical  PML  region,  as  shown  in  Fig.  3,  cylindrical  sector  elements  are 
chosen. 


3 


Fig.  3.  sector  element 


The  interpolation  Nj(p,<j>)  for  a  sector  element  are: 

Ni(p,  0)  =  (/>  ~  P2)(4>  -  02 )/A,  NUp,  0)  =  ~(p  -  p2)(0  -  01  )/A 

(p>  0 )  =  ~(p  -  Pi)(0  -  02)M.  NUp,  0)  =  (p  -  Pl)(0  -  0OM  (5) 

where  A  =  (pi~  p2)(0i  -  02) 

iv.  FEM-PML  Formulation 


To  solve  a  scattering  problem,  the  open-region  should  be  truncated  by  PML  which  is  in  turn  covered  by  perfectly 
electrical  conductor  (PEC),  as  shown  in  Fig.  1. 


In  region  fi3,  only  scattered  field  exists.  In  region  fix  and  fi2,  the  field  satisfies  the  double-curl  equation: 

Vx[(p-1(VxE)]-fcgeJ5  =  0  (6) 

where  B  =  +  E^nc\  2£(mc)  is  the  incident  field,  and  is  the  scattered  field. 


In  all  regions  (fl$,  i  =1,  2,  and  3),  the  scattered  field  satisfies  the  following  equations: 


V  x  [p-1  •  (V  X  £(“>)]  -  kpE^  =  F <inc> 

(7) 

p(inc)  _  V  x  [^-1(V  x  £(<"<0)]  _  ifc2ef(inc) 

in  fix  and  ft2, 

(8) 

p(mc)  _  q 

in  ft3 

0) 

n  x  E (“)  =  0 

on  52 

(10) 

where  S2  is  the  outer  PEC  boundary  of  PML. 

This  boundary  value  problem  can  be  formulated  by  Ritz  method: 

F(0ac))  =  J  [(p-1  ■  V  x  E(sc))  ■  (V  x  &sc))  -  Jfegfi5<sc>  •  #“>]  dV  -  2  J  •  F^dv  (11) 
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Equation  (11)  can  be  described  by  two  different  elements:  the  conventional  triangular  or  quadrilateral  elements  in 
region  1  and  2,  and  the  sectorial  elements  in  region  3.  In  regions  1  and  2,  the  interpolation  function  is  Nt,  while, 
in  region  3,  the  interpolation  function  is  Nc.  The  electrical  field  is: 


4=1 

if  the  element  is  in  region  1  or  2 

(12) 

4 

Ei  =  ^2Ei  Nci’ 

if  the  element  is  in  region  3 

(13) 

»=i 


where  E{  are  the  electrical  field  at  the  nodal  points  of  each  element. 
The  open  region  problem  can  be  solved  through  equation  (14) 


mm  =  [B] 

(14) 

where  in  regions  1  and  2: 

m  = 

[  j + 

Jvi o  p/  dy  n  dy  '  +  p*1  dx  n  dx  s 

(15) 

[. Be 1  =  !  {Ne}F^dv 

(16) 

JvM 

In  region  3 

* 

II 

(17) 

[B]=  (  {ATe}F<iT!CW 

JVM 

(18) 

v.  Implementation  and  Experiment 


In  this  section,  an  example  of  2D-TM  incidence  from  a  homogeneous  cylinder  of  circular  cross-section  is  studied  by 
using  cylindrical  PML.  The  parameters  of  the  PML  material  in  a  point  (p,  4>)  axe  assumed  as:  sp  =  l~j<rmaxd?/D2, 
and  =  1  —  jCmax  where  amax  denotes  the  maximal  loss  tangent,  D  the  thickness  of  the  PML,  and  d  the 
distance  between  the  point  (p,  4>)  and  the  air-PML  interface. 

In  this  example,  both  implementations  of  cylindrical  PML  are  compared.  In  the  coupling  element  method,  the 
triangular  elements  are  used  in  scatterer  and  buffer  region,  and  the  sectorial  elements  are  used  in  PML  region.  In 
the  coordinate  transformation  method,  only  triangular  elements  are  used,  and  one-point  and  three-point  Gaussian 
numerical  integrations  are  used,  respectively.  The  numerical  results  of  these  two  methods  are  shown  in  Fig.  4. 

From  Fig.  4,  we  can  see  clearly  that  the  coupling  element  method  achieves  more  accurate  result  than  that  of 
the  transformation  method.  The  accuracy  of  transformation  method  can  be  improved  if  the  integral  points  are 
increased.  But  the  increase  of  the  number  of  integral  points  will  also  increase  computational  cost.  The  computa¬ 
tional  time  that  the  two  methods  spend  in  forming  the  coefficient  matrix  is  compared  in  Table  I,  it  can  seen  that 
the  coupling  element  method  takes  much  less  time  than  the  transformation  method.  More  time  will  be  required  in 
the  transformation  method  if  the  number  of  integral  points  is  further  increased.  This  increase  of  the  computation 
time  may  be  formidable  for  3D  problems. 
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Fig.  4.  RCS  of  a  circular  dielectric  cylinder.  The  scatterer  is  a  homogeneous  cylinder,  its  radius  is  0.3A,  e  =  4,  p  =  1,  TM  plane  wave 
incident  in  —  x  direction 


TABLE  I 

Compare  the  coupling  element  and  triangle  element 


coupling  method 

1  point  integral 

3  point  integral 

matrix  A 

18.10  (sec) 

28.08  (sec) 

39.89  (sec) 

vi.  Conclusions 


In  this  study,  a  coupling  element  method  is  proposed  for  cylindrical  PML  and  FEM  implementation.  This  method 
uses  different  coordinate  systems  in  different  regions,  i.e.,  sectorial  elements  in  PML  region,  and  finite  elements 
in  scatterer  and  buffer  regions.  This  method  takes  full  advantage  of  the  cylindrical  PML  method  by  avoiding 
the  transformation  of  different  coordinates.  Numerical  experiments  show  that  this  technique  is  more  effective  and 
accurate  than  traditional  transformation  method. 
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Abstract 

Wavelet  transforms  are  often  used  to  compress  dense  matrices  arising  in  discretiz¬ 
ation  of  electromagnetic  integral  equations.  The  problem  which  kind  of  wavelets  are 
best  suited  for  this  application  is  not  yet  settled.  Recently  some  studies  showed  that 
the  nonorth ogonal  wavelet  transforms  produce  sparser  matrices  than  those  based  on 
orthonormal  wavelets.  This  paper  studies  the  effect  of  B-spline  biorthogonal  wave¬ 
let  transforms  (BWTs)  on  conditioning  and  compression  of  impedance  matrices.  We 
demonstrate  that  large  condition  numbers  produced  by  the  BWTs  cause  corresponding 
increases  in  number  of  iterations  needed  to  solve  the  sparsified  systems.  This  tends  to 
offset  any  gains  from  higher  sparsity. 


1  Introduction 

Discretization  of  electromagnetic  integral  equations  via  the  method  of  moments  produces 
dense  matrix  equations.  For  electrically  large  objects  the  systems  become  so  large  that  it¬ 
erative  solvers  must  be  used.  In  such  cases,  an  approximation  to  the  solution  is  obtained  in 
0(pN 2)  hops,  where  p  is  the  number  of  iterations  and  N  the  size  of  the  system,  in  contrast 
to  0(N3)  flops  used  by  the  direct  solver.  In  order  to  further  reduce  the  operation  count 
various  techniques  have  been  used  to  sparsify  the  system  matrices.  Among  such  approaches 
the  wavelet  transforms  continue  to  attract  considerable  interest  [1,  2,  3,  4].  Information  is 
still  needed  on  what  kind  of  wavelet  transforms  is  more  effective  to  solve  the  EM  problems. 
Recently  it  has  been  demonstrated  that  the  semi-orthogonal  wavelet  transforms,  construc¬ 
ted  by  Chui  and  Wang  [5],  produce  higher  matrix  sparsities  than  the  orthonormal  wavelet 
transforms  at  the  same  level  of  accuracy  [6,  7].  This  paper  studies  the  effect  of  another  B- 
spline  based  transform,  the  biorthogonal  wavelet  transforms  (BWTs),  proposed  by  Cohen, 

“This  research  was  supported  by  a  grant  from  The  Boeing  Company. 
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Daubechies,  and  Feauveau  [8],  on  conditioning  and  sparsification  of  impedance  matrices. 
Although  the  BWTs  have  the  attractive  property  of  finite  length  filters  for  both  primary 
and  dual  wavelets,  we  find  that  the  BWTs  lead  to  large  condition  numbers  increasing  the 
number  of  iterations  needed  to  solve  the  matrix  equations.  Thus  the  cost  of  iterative  solu¬ 
tion  of  systems  sparsified  by  the  BWTs  is  larger  than  the  corresponding  cost  for  orthonormal 
wavelet  transforms. 


2  Matrix  Sparsification 

Consider  the  matrix  equation  ZJ  —  E  obtained  by  the  method  of  moments  from  an  electro¬ 
magnetic  integral  equation.  Here  Z  represents  the  dense  (diagonalizable)  impedance  matrix 
of  size  N  x  N,  J  is  the  surface  current  vector,  and  E  is  the  excitation  vector.  To  introduce 
sparsifying  transformations  we  let  T  be  the  wavelet  matrix  and  denote  by  Tf  its  inverse. 
Consider  the  new  system 

(TZT^TJ)  =  (: TE ), 

which  can  be  written  as  Z'J'  =  E'.  Various  wavelet  transforms  T  produce  matrices  Z'  with 
many  small  elements  which  can  be  set  to  zero  without  adversely  affecting  the  quality  of 
the  solution  approximation.  Matrix  T  is  usually  implemented  by  repeated  application  of 
low-pass  and  high-pass  filters. 

Popular  candidates  for  such  filters  are  based  on  Daubechies’  compactly  supported  orthonor¬ 
mal  wavelets  [1,  2,  3]  and  B-spline  biorthogonal  wavelets  of  Cohen,  Daubechies,  and  Feauveau 
[6].  In  the  case  of  orthonormal  wavelets  T  =  T\  so  that  matrix  Z'  is  orthogonally  similar 
to  matrix  Z.  Thus  Z  and  Z'  have  the  same  eigenvalue  spectrum  and  if  X  represents  the 
matrix  of  N  eigenvectors  for  matrix  Z  then  TX  is  the  matrix  of  eigenvectors  for  matrix 
Z' .  This  is  important  since  the  convergence  of  iterative  solvers  deteriorates  with  increase 
of  condition  number  of  the  system  matrix  k{Z)  (for  the  conjugate  gradient  on  the  normal 
equations  (CGNE)  solver)  or  with  condition  number  of  the  matrix  of  eigenvectors  k(W)  (for 
the  generalized  minimal  residual  (GMRES)  solver)  [9].  If  T  is  orthogonal,  then  k(Z)  =  k(Z') 
and  k(X)  =  k(TX)  so  that  iterative  solvers  will  exhibit  the  same  convergence  history  for 
both  ZJ  —  E  and  Z'J'  =  E'  systems. 

The  situation  is  quite  different  in  the  case  of  biorthogonal  wavelets.  In  this  case  Z'  is  not 
similar  to  Z  which  implies  different  spectra  and  different  condition  numbers  for  matrices  of 
corresponding  eigenvectors.  The  next  sections  examine  the  dependence  of  these  indicators 
on  the  size  of  Z  and  the  wavelet  as  well  as  their  impact  on  convergence  properties  of  the 
CGNE  solver. 
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3  Compactly  supported  orthonormal  and  biorthogonal 
wavelet  transforms 

Detailed  descriptions  of  compactly  supported  orthonormal  wavelet  transforms  and  the  B- 
spline  biorthogonal  wavelet  transforms  can  be  found  in  [10]  and1  [8],  respectively.  This  section 
introduces  a  specific  notation  needed  to  explain  our  results.  We  define  the  /-level  wavelet 
transform  T  by  the  product  of  one  level  wavelet  transforms  Wk 

T  =  Wn_l+1---Wn.1Wn , 


~ 

'  H-J  ' 

1 

II 

\ 

.  Gn-j 

sT 

i 

s; 

i _ 

where  Hn„j,Gn_j  are  low-  and  high-pass  matrices  defined  by  their  filter  coefficients  {hk,gk}, 
which  are  highly  related.  Thus,  in  the  case  of  the  Daubechies  orthonormal  wavelets  (OWT) 
with  p  vanishing  moments,  the  high-pass  coefficients  gk  can  be  expressed  in  terms  of  low-pass 
coefficients  hk  as  gk  =  (-l)%p_fc+3,  k  =  1, . . . ,  2p+2.  In  the  case  of  biorthogonal  wavelets 
coefficients  gk  are  given  by  gk  —  (— l)*A2p_fc+3,  where  {hj}  denote  the  coefficients  of  the  dual 
scaling  function  and,  in  general,  p^P-  For  the  family  of  (m,  A )—  BWTs  based  on  B-splines 
of  order  m,  the  filter  coefficients  hk  and  hk  are  defined  by  the  identities  [8] 

2.  (1) 

where  parameter  K  satisfies  K  >  (m  +  l)/2.  It  follows  that  for  a  given  parameter  pair 
(m,  K)  there  are  m  +  2  and  AI\  -  m  —  2  nonzero  coefficients  hk  and  hk,  respectively.  The 
Haar  wavelet  is  obtained  by  setting  m  =  0,  K  =  1.  If,  for  example,  m  =  2  and  K  =  3,  then 

[A0, hl7...,h3]  =  [1,3,3,  l]V2/8, 

[h0, . . . ,  h7]  =  [3,  -9,  -7, 45, 45,  -7,  -9, 3]  V2/64. 

The  number  of  vanishing  moments  for  this  type  of  wavelets  is  2A  —  m  —  2.  It  is  worth  noting 
that  the  filter  coefficients  for  the  biorthogonal  wavelets  are  rational  numbers  (except  for  the 
common  factor  y/2).  Both  the  scaling  and  wavelet  functions  have  simple  expressions  and  thus 
are  well  suited  for  direct  method  of  moments  computations  as  basis  functions.  This  is  not 
the  case  for  the  compactly  supported  orthonormal  wavelets  (except  for  the  Haar  wavelet). 
However,  as  we  will  see  in  the  next  sections,  biorthogonal  wavelets  give  rise  to  impedance 
matrices  with  large  condition  numbers. 


4  Conditioning  of  B-spline  biorthogonal  wavelet  trans¬ 
forms 

Define  the  condition  number  of  a  given  matrix  T  by  k{T)  =  ||T,|[2|[T''1||2,  where  j|  -  |j2 
denotes  the  matrix  2-norm.  It  has  recently  been  shown  that  the  condition  numbers  of  one- 


158 


level  B-spline  BWTs  given  by  formula  (1)  increase  with  m  at  least  as  fast  2m  [11].  Table  1 
illustrates  the  growth  of  k(T)  as  a  function  of  m.  Note  that  for  fixed  m,  condition  number 
k(T)  quickly  stabilizes  at  2m  as  K  increases.  Since  the  number  of  vanishing  moments  for 
this  type  of  wavelet  is  p  =  2K  —  m  —  2  and  the  matrix  sparsity  is  related  to  p,  the  best 
strategy  to  minimize  k(T )  and  maximize  p  is  to  choose  small  m  and  large  K. 


Table  1:  Condition  numbers  k(T)  for  one-level  (m,  K) BWT 


K  \  m 

0  2  4  6  8 

1 

LOO 

3 

1.42  4.00  16.0 

5 

1.58  4.00  16.0  64.0  834. 

7 

1.68  4.00  16.0  64.0  259. 

9 

1.75  4.00  16.0  64.0  256. 

Table  1  show's  the  condition  numbers  of  just  one  level  of  wavelet  transformation.  Since,  in 
practice,  more  than  one  level  is  used  we  study  the  condition  number  of  a  sample  of  BWTs 
as  a  function  of  number  of  levels  l.  Table  2  shows  that  k(T )  increase  fairly  rapidly  with 
l  for  BWTs  with  m  >  2  and  so  the  asymptotic  bounds  must  turn  out  to  be  quite  large. 
Asterisks  denote  cases  where  no  transformation  was  possible  due  to  the  length  of  the  filter. 
In  addition,  for  each  fixed  level  /,  the  condition  numbers  are  larger  for  larger  m.  This  tends 
to  rule  out  the  use  of  BWTs  with  large  m.  Interestingly,  it  has  been  shown  recently  that 
k(T)  for  each  BWT  is  bounded  as  l  -»  oo  (and  thus  the  BWTs  are  stable  in  mathematical 
terms)  [8,  12]. 


Table  2:  Condition  numbers  k(T)  for  (m,  A^)BWT  with  l  levels 


1 

1 

2 

3 

4 

5 

(m,K) 

(0,2) 

1.28 

1.42 

1.52 

1.60 

1.65 

(0,3) 

1.42 

1.66 

1.82 

1.93 

2.00 

(0,4) 

1.51 

1.82 

2.02 

2.15 

2.24 

(0,5) 

1.58 

1.94 

2.16 

2.31 

* 

(2,2) 

4.00 

6.39 

10.0 

15.1 

22.6 

(2,3) 

4.00 

5.27 

6.28 

7.31 

7.87 

(4,5) 

16.0 

25.6 

40.3 

62.4 

96.1 

(6,7) 

64.0 

131. 

415. 

1221. 

* 

(8,9) 

!  256. 

876. 

5995. 

35980. 

* 

The  condition  numbers  of  system  matrices  are  related  by  k(Z')  <  k(Z)k2(T).  Thus  k(T) 
is  only  an  indirect  measure  of  conditioning  of  system  matrix  Z'.  To  provide  a  comparison, 
Table  3  lists  the  condition  numbers  of  Z'  for  a  system  obtained  from  the  point  matching 
discretization  of  the  combined  field  electric  equation  for  a  circular  cylinder  with  10  pulse 
functions  per  wavelength  and  N  =  256.  Note  that  the  column  with  1  =  0  corresponds  to 
k(Z)  (the  original  matrix).  The  results  show  that  condition  number  of  the  transformed 
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matrix  Z'  grows  rapidly  with  number  of  levels  l  (for  fixed  m  >  2  and  K)  and  with  spline 
order  m  (for  fixed  level  /).  In  addition,  the  comparison  of  Tables  2  and  3  shows  that  the 
bound  k(Z)k2(T)  is  quite  accurate. 


Table  3:  Condition  numbers  k(Z’)  for  (to,  7QBWT  with  l  levels 


1 

0 

1 

2 

3 

4 

5 

(m.K) 

(0,2) 

6.79 

7.35 

8.76 

S.62 

9.82 

9.83 

(0,3) 

6.79 

7.48 

9.81 

l.Oel 

1.2el 

1.2el 

(0,4) 

6.79 

7.52 

l.Oel 

l.lel 

1.4el 

1.4el 

(0,5) 

6.79 

7.53 

l.lel 

1.2el 

1.5el 

* 

(2,2) 

6.79 

1.6el 

4.7el 

1.5e2 

3.4e2 

7.5e2 

(2,3) 

6.79 

1.6el 

2.8el 

4.7e2 

5.6e2 

1.2e2 

(4,5) 

6.79 

2.5e2 

5.7e2 

1.5e3 

3.6e3 

7.7e2 

(6,7) 

6.79 

4.1e3 

l.le4 

4.4e4 

1.9e5 

* 

(8,9) 

6.79 

6.6e4 

7.5e5 

3.6e7 

1.3e9 

* 

To  illustrate  the  convergence  of  iterative  solvers,  the  original  and  B  W T-transformed  systems 
described  above  with  N  -  256  were  solved  by  the  CGNE  method  with  relative  tolerance 
tol=le-3.  There  was  no  preconditioning  and  the  iterations  started  with  the  zero  vector.  The 
column  with  l  =  0  shows  iterations  with  the  original  impedance  matrix  Z.  In  addition, 
we  performed  iterations  with  matrix  Z  transformed  with  Daubechies  orthonormal  wavelets 
with  vanishing  moments  p  =  1, ... ,7  and  levels  /  =  1, . . .  ,5.  All  such  runs  ended  with  the 
same  number  of  16  iterations.  Note  the  considerable  increase  of  iterations  for  the  BWT- 
transformed  systems  with  m  >  2  and  strong  correlation  between  the  size  of  k(T)  (see  Table 
2)  and  the  number  of  iterations.  The  growth  of  number  of  iterations  for  m  =  0  is  relatively 
mild.  From  these  results  we  conclude  that  the  only  practical  (m,  A')BWTs  are  those  with 
m  =  0  (small  condition  numbers)  and  large  K  (large  number  of  vanishing  moments). 


Table  4:  CGNE  iterations  with  tol=le-3  for  (m,/\)BWT  with  l  levels. 


1 

0 

1 

2 

3 

4 

5 

(m,K) 

(0,2) 

16 

18 

22 

20 

22 

22 

(0,3) 

16 

18 

23 

24 

26 

26 

(0,4) 

16 

18 

25 

23 

30 

30 

(0,5) 

16 

18 

26 

25 

33 

* 

(2,2) 

16 

35 

60 

71 

55 

60 

(2,3) 

16 

36 

55 

60 

76 

72 

(4,5) 

16 

99 

122 

73 

85 

93 

(6,7) 

16 

193 

124 

84 

87 

* 

(8,9) 

16 

256 

104 

133 

152 

* 

In  our  last  experiment  we  compare  the  compression  rates  (the  number  of  non-zeros  divided 
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by  N2)  and  the  numbers  of  CGNE  iterations  for  the  circular  cylinder  problem  described 
earlier.  Only  results  for  the  N  —  256  are  given  here.  Note  that  for  larger  N  more  levels  of 
BVVT  transforms  will  have  to  be  used  leading  to  higher  condition  numbers  and  thus  more 
iterations.  We  use  both  (m,  A')BWTs  and  the  OWTs  with  various  numbers  of  vanishing 
moments.  The  number  of  transformation  levels  varies  with  each  method  (depending  on  filter 
lengths).  The  sparsification  of  the  matrix  is  via  a  thresholding  technique  designed  to  keep 
the  relative  errors  within  1%  of  the  actual  solution.  We  included  in  Table  5  only  the  BWTs 
with  m  =  0  in  view  of  the  previous  remarks.  In  fact,  higher  values  of  m  give  rise  to  denser 
matrices.  The  results  suggest  that  the  OWTs  perform  better  than  any  of  the  BWTs. 


Table  5:  Compression  rates  and  CGNE  iterations  with  toI=le-3  for  (m,  K )BWTs  and  OWTs 


method 

BWT 

BWT 

BWT 

OWT 

OWT 

order 

(0,3) 

(0,4) 

(0.5) 

4 

8 

moments 

4 

6 

s 

4 

8 

compression 

22% 

21% 

20% 

15% 

11% 

iterations 

26 

30 

28 

16 

16 

5  Conclusions 

Some  nonorthogonal  wavelet  transforms  have  been  shown  to  produce  higher  compression  of 
matrices  in  computational  electromagnetics  than  the  orthonormal  wavelet  transforms.  This 
paper  studies  the  conditioning  of  B-spline  biorthogonal  wavelet  transforms  in  the  context  of 
their  application  to  computational  electromagnetics.  We  found  that  the  condition  numbers 
of  the  transformed  matrices  increase  considerably  with  the  wavelet  order  m  and  the  number 
of  wavelet  levels.  Even  for  small  number  of  vanishing  moments  m,  for  which  the  growth  of 
condition  numbers  is  not  rapid,  it  directly  affects  iterative  solvers.  In  addition,  the  compres¬ 
sion  rates  obtained  with  the  help  of  the  BWTs  were  worse  than  those  for  the  orthonormal 
wavelet  transforms.  We  conclude  that  the  biorthogonal  wavelet  transforms  should  not  be 
recommended  for  matrix  compression  in  computational  electromagnetics. 

A  more  detailed  study  of  conditioning  of  other  non-orthogonal  wavelet  transforms  including 
optimal  scaling  of  BWTs,  semiorthogonal  wavelet  transforms,  and  wavelet  packets  will  be 
presented  elsewhere. 
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I.  Abstract 

This  paper  presents  a  simple,  yet  powerful  scheme  to  incorporate  Bayliss-Turkel 
Absorbing  Boundary  Conditions  (ABCs)  of  any  order  into  the  finite  element  simula¬ 
tion  of  open-region  radiation  problems.  The  method  is  based  on  using  finite-difference 
approximation  of  the  normal  derivatives  in  the  original  Bayliss-Turkel  ABC  formu¬ 
lation.  This  makes  possible  the  implementation  of  BT  operators  of  order  three  or 
higher,  thus  achieving  levels  of  accuracy  unrealized  previously. 


II.  Introduction 

The  Bayliss-Turkel  (BT)  boundary  operators  theory  states  that  spurious  (non¬ 
physical)  reflections  arising  from  the  mesh-truncation  boundary  can  be  made  ar¬ 
bitrarily  small  by  chosing  an  operator  with  the  proper  order  (in  three-dimensional 
space,  the  spherical  mesh-truncation  boundary  can  be  located  arbitrarily,  whereas 
in  two-dimensional  space,  the  BT  theory  strictly  applies  to  circular  mesh-truncation 
boundaries  located  in  the  asymptotic  region)  [lj.  However,  despite  this  attractive 
feature,  only  first-  and  second-order  operators  were  actually  implemented  in  finite  el¬ 
ement  codes.  The  reason  for  this  had  to  do  with  the  difficulty  of  representing  normal 
derivatives  in  terms  of  tangential  derivatives  for  boundary  operators  of  order  three 
or  higher  [2].  More  explicitly,  let  us  consider  the  homogeneous  Helmholz  equation  in 
two-dimensional  free  space.  Assuming  scattering  by  a  perfectly  conducting  object, 
Galerkin  procedure  leads  to: 

/  VvVu  -  k2vu  dn  =  j  v  ~  dr  +  [  dr  m 

Jet  Jrc  dn  Jr0  dn  v 

where  fl  is  the  solution  space  of  the  problem  and  Tc  and  T0  are  the  conductor  boundary 
and  the  cylindrical  mesh-termination  (outer)  boundary  respectively.  The  unknown 
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field  is  represented  by  u  and  the  testing  function  by  v  (for  more  details  on  the  finite 
element  derivation  of  (1),  see  [3]). 

The  normal  derivative  on  the  right-hand  side  of  (1)  is  unavailable  on  the  outer 
boundary  of  the  finite  element  mesh  and  therefore,  to  complete  the  solution,  this 
normal  derivative  must  be  exchanged  with  the  field  and  its  tangential  derivatives 
on  the  boundary.  Unfortunately,  only  the  first-  and  second-order  BT  operators  can 
be  cast  in  the  format  that  expresses  the  normal  derivative  in  terms  of  tangential 
derivatives.  BT  operators  of  third  or  higher  order  contain  mixed  derivatives. 

Despite  this,  however,  one  can  find  in  the  literature  several  ABC  constructions 
based  on  the  BT  operators  that  extend  beyond  the  second-order  [2].  These  con¬ 
structions,  however,  are  only  approximations  of  higher-order  operators  where  mixed 
derivatives  are  either  assumed  negligible  or  replaced  by  simple  approximations  that 
give  results  with  varying  degrees  of  accuracy.  Unfortunately,  such  approximations, 
for  the  most  part,  are  unsubstantiated  analytically  and  often  result  in  unsatisfactory 
levels  of  accuracy  [2]. 

In  this  work,  instead  of  interchanging  the  normal  derivative  with  tangential  deriva¬ 
tives,  we  maintain  the  original  form  of  the  operator  and  express  (discretize)  higher- 
order  operators  as  a  telescopic  finite- difference  expressions.  The  attractive  features 
of  this  implementation  are:  1)  it  does  not  require  the  use  of  higher-order  finite  ele¬ 
ments,  2)  it  is  remarkably  simple  to  implement,  while  affecting  the  overall  sparsity 
of  the  finite  element  matrix  in  a  negligible  way,  and  3)  it  provides  levels  of  accuracy 
unrealized  before. 

This  paper  presents  the  new  formulation  for  the  scalar  two-dimensional  wave  equa¬ 
tion.  Numerical  experiments  will  be  given  to  demonstrate  the  levels  of  accuracy  that 
can  be  achieved  in  comparison  to  previous  implementations.  Finally,  we  present  the 
extension  of  the  new  formulation  to  the  three-dimensional  scalar  and  vector  wave 
equation. 


III.  Formulation 

The  Nth  order  Bayliss-Turkel  operator  applied  on  a  cylindrical  outer  boundary  is 
expressed  as  [1]: 


2 i  -  3/2 
P 


u  —  0 


(2) 


where  p  is  the  radial  variable  and  k  is  the  wave  number.  When  (2)  is  applied  at  an 
outer  boundary  located  at  a  fixed  radius  p,  the  error  in  the  solution  is  of  the  order 
2Na+1/Sl  where  the  constant  a  depends  on  the  proximity  of  the  outer  boundary  to 
the  scattering  object  and  the  wave  number.  (For  two-dimensional  cylindrical  outer 
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boundaries,  this  accuracy  measure  is  valid  in  the  asymptotic  region  only.) 

Next,  we  transform  (2)  into  a  discrete  operator  using  the  backward  finite-difference 
approximation  for  the  normal  derivative,  vis., 


(i-s-') 

A  p 


(3) 


where  /  is  the  identity  operator,  S'-1  is  the  space  shift  operator  and  A p  is  the  space 
step  in  the  radial  direction.  Substituting  (3)  into  (2),  we  arrive  at  a  finite-difference 
representation  of  the  boundary  operator  in  (2): 


^w=n(<^-!^)=o  (4) 

where  a,  =  1/A p  +  jk  +  (2 i  —  3/2)/ p.  In  (5),  Bd  denotes  a  discrete  operator.  Notice 
that  (4)  is  a  backward  finite-difference  transformation  of  the  operator  in  (2)  (using 
nodes  internal  to  the  boundary).  To  enable  the  numerical  implementation  of  (4), 
the  finite  elements  that  are  adjacent  to  the  outer  boundary  must  be  structured  such 
that  their  nodes  lie  along  mutually  orthogonal  curves  as  shown  in  Fig.  1.  In  light  of 
the  flexibility  in  creating  the  finite  element  mesh,  such  constraint  does  not  pose  any 
difficulty. 

The  transformation  of  the  boundary  operator  into  a  telescoping  finite  difference 
expression  as  in  (4)  was  successfully  used  by  Higdon  to  implement  the  absorbing 
boundary  operators  that  he  developed  for  the  finite- difference  time-domain  solution 
of  the  wave  equation  [4].  It  is  worth  mentioning  here  that  Higdon’s  operators  axe  a 
special  class  of  the  BT  operators  (which  can  realized  when  p  ->■  oo  and  completing 
the  time-domain  to  frequency-domain  transformation,  i.e.,  dt  — v  k). 

IV.  Implementation  into  Finite  Element  Simulation 


The  implementation  of  (4)  into  a  finite  element  formulation  is  carried  out  at  the 
numerical  level  by  direct  substitution  into  the  finite  element  matrix.  For  brevity,  we 
skip  the  finite  element  formulation  and  start  with  the  matrix  equation.  Let  us  denote 
the  boundary  nodes  that  lie  on  the  outermost  boundary  surface  as  TJ0,  the  nodes 
that  are  adjacent  to  the  outermost  surface  and  are  involved  in  the  description  of  the 
boundary  operator  as  U&,  and  the  remaining  nodes  as  U;.  Enforcing  (4)  at  each  of 
the  outer  boundary  nodes  gives  the  following  algebraic  equation: 


U0  =  BXJi 


(5) 


The  finite  element  matrix  can  be  represented  symbolically  as 
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(6) 


'  Mo0  Mot 

'u0‘ 

'F0  ‘ 

Mto  MW  Mi,- 

Ut 

= 

Ft 

M,t  Mu  _ 

u. 

F, 

The  vectors  F0,  Fb  and  F,  are  related  to  the  excitation  and  all  boundary  conditions. 
Finally,  we  substitute  (5)  into  (6)  to  obtain  the  reduced  system  matrix: 


MtoB  +  Mfcj, 

M«' 

'u6' 

Ft 

M*t 

Mi, 

Ui 

.  Fi. 

We  note  that  the  fact  that  the  ro  boundary  integral  was  not  implemented  numer¬ 
ically  does  not  introduce  any  deficiency  in  our  new  implementation.  The  matrix 
boundary  condition  substitution  of  (5)  into  (6)  constitutes  a  constraint  on  the  nodal 
fields  such  that  they  admit  outgoing  waves  only.  With  such  substitution,  the  finite 
element  matrix  system  becomes  well  posed  in  the  sense  that  a  unique  solution  is 
admissible.  This  solution  satisfies  the  wave  equation  in  any  unconstrained  portion 
of  the  domain,  satisfies  any  physical  boundary  conditions  (as  would  be  dictated,  for 
instance,  by  perfectly  conducting  objects),  and  admits  outgoing  waves  on  the  outer 
mesh  boundary.  For  completeness,  one  can  still  assume  that  the  boundary  line  inte¬ 
gral  in  (1)  is  applied  at  a  circular  outer  boundary  that  is  positioned  external  to  the 
boundary  where  (4)  is  applied.  However,  the  application  of  (4)  conceals  any  effect 
this  line  integral  might  have  on  the  system  matrix. 

V.  Numerical  Experiment 

To  test  the  effectiveness  of  this  new  and  highly  simple  scheme,  we  study  the  classical 
problem  of  plane  wave  scattering  from  perfectly  conducting  circular  cylinder.  We 
consider  a  cylinders  of  radius  8A.  The  mesh  for  this  problem  consists  of  500  uniformly 
spaced  nodes  in  the  angular  direction  and  6  uniformly  spaced  nodes  in  the  radial 
direction  with  A p  —  0.03A.  Thus,  the  BT  operators  are  applied  at  a  distance  of  0.15A 
from  the  cylinder’s  surface.  We  present  results  for  the  H-polarized  case  only  since  it 
has  been  shown  that  this  polarization  is  significantly  more  susciptible  to  ABCs  than 
the  E-polarized  case  [2]. 

We  apply  the  second-  and  fourth-order  BT  operators  using  the  new  implementation. 
In  Figs.  2  and  3,  we  show  calculation  of  the  surface  current  and  Radar  Cross  Section 
(RCS).  For  this  cylinder,  there  is  a  sizable  number  of  harmonics  containing  evanescent 
energy  and  thus  one  expects  the  second-order  BT  operators  to  yield  high  errors  as  was 
previously  confirmed  in  [2].  The  results  obtained  using  the  fourth-order  BT  operator 
show  a  significant  enhancement  in  accuracy  careful  observation  of  Fig.  3  reveals  that 
the  BT4  operator  results  in  a  maximum  error  in  RCS  of  only  1.3  dB. 
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VI.  Extension  to  the  scalar  and  vector  three-dimensional  wave 

EQUATION 

The  finite  element  solution  of  the  vector  Helmholtz  equation  is  more  difficult  than 
that  of  the  scalar  one.  Absorbing  boundary  conditions  (ABCs)  that  were  developed 
earlier  for  the  vector  wave  equation  were  complex  [2], [5], [6].  In  fact,  the  numerical 
implementations  of  second-order  ABCs  were  the  only  that  were  reported  in  the  litera¬ 
ture  because  ABCs  of  order  three  or  higher  were  immensely  difficult  to  implement  in  a 
finite  element  numerical  code.  Second-order  operators  were  found  to  yield  satisfactory 
solutions  only  when  the  outer  boundary  was  positioned  one  or  more  wavelengths  away 
from  the  scatterer,  thus  increasing  the  computational  cost  substantially.  Absorptive 
material,  on  the  other  hand,  such  as  the  perfectly  matched  layer  (PML),  promises 
perfect  matching  in  the  analytic  domain  only,  and  the  performance  when  incorpo¬ 
rated  into  a  discrete  system  might  not  be  very  satisfactory.  The  development  of  PML 
for  three-dimensional  spherical  coordinates  has  recently  been  reported  in  [7]- [9]. 

Here,  we  develop  a  series  of  simple  operators  for  the  finite  element  solution  of  the 
three-dimensional  vector  wave  equation.  Unlike  the  methodologies  adopted  earlier, 
namely,  that  of  developing  operators  by  manipulating  the  vector  field,  and  thus  ob¬ 
taining  boundary  conditions  that  involve  the  vector  field  itself,  we  develop  operators 
that  can  be  applied  on  the  scalar  field  components  of  the  vector  field. 

VII.  Boundary  Operators  for  Spherical  Boundaries 

According  to  the  expansion  theorem  of  Wilcox  [10],  a  vector  field  E  in  three- 
dimensional  space  can  be  represented  by  a  uniformly  convergent  series  in  powers 
of  1/r: 


EM,*) 


en(M 


(8) 


In  (8)  the  vectors  en  are  determined  recursively  (see  [10]  for  a  thorough  exposition  of 
the  Wilcox  expansion  theorem  and  related  corollaries).  From  (8),  each  of  the  scalar 
field  components  of  the  vector  field  can  be  expressed  as: 


EaM,*) 


e-jkr 
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71=0 


en  M) 


(9) 


where  a  =  <f>,0,r.  The  representation  of  the  scalar  field  components  as  in  (9)  is  sig¬ 
nificant  because  it  shows  that  these  scalar  spherical  components,  despite  the  fact  that 
they  are  not  scalar  radiation  functions,  can  still  be  represented  in  a  series  of  powers 
1/r.  The  fact  that  these  series  are  strictly  not  a  Wilcox  expansion  is  inconsequential 
to  our  goal  here  of  developing  absorbing  boundary  operators. 
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The  series  in  (9)  possess  all  the  convergence  properties  of  the  Wilcox  expansion, 
and  furthermore,  these  series  can  be  differentiated  term  by  term  with  respect  to  r. 
Therefore,  we  can  conveniently  use  the  procedure  developed  by  Bayliss  and  Turkel 
[1]  to  show  that  Ea  satisfies  the  Nth  order  operator 

B  =  n(ar+j*  +  — )  (10) 

»=1  T 

up  to  a  residual  of  0(l/r2N+1). 

For  the  numerical  solution  of  an  open-region  finite  element  problem,  only  the  tan¬ 
gential  field  components  need  to  be  evaluated  on  the  boundary  of  the  domain,  there¬ 
fore,  as  a  boundary  condition  for  the  vector  wave  problem,  we  write 

*(£)-n  <^*+^>U)  =  °  (11) 

The  family  of  operators  in  (11)  involve  radial  derivatives  only,  and  since  the  series 
in  (9)  is  uniformly  convergent  in  0 ,  <j>  and  r,  for  any  fixed  r,  the  operators  in  (11)  can 
be  implemented  to  yield  any  arbitrary  accuracy  desired  (i.e.,  increased  accuracy  with 
increased  order).  (Of  course,  in  numerical  implementations,  discretization  artifacts 
put  a  limit  on  the  accuracy  that  is  predicted  by  theory.)  Additionally,  the  well 
posedness  of  (5)  is  assured  since  it  follows  from  the  the  Bayliss-Turkel  theory  [1],[11]. 

The  numerical  implementation  of  (8)  and  (9)  is  identical  to  the  procedure  outlined 
above  for  the  two-dimensional  wave  equation,  except  for  the  analogous  extension  of 
two-dimensional  cylindrical  geometry  to  three-dimensional  spherical  geometry  (for 
instance,  the  nodes  of  the  finite  elements  adjacent  to  the  spherical  outer  boundary 
must  lie  on  mutually  orthogonal  planes.) 

VIII.  Summary 

This  paper  presented  an  exact  implementation  of  BT  operators  of  any  order  in  a 
finite  element  scheme.  This  makes  possible  the  implementation  of  BT  operators  of 
order  three  or  higher,  thus  achieving  levels  of  accuracy  unrealized  previously.  The 
numerical  implementation  of  this  scheme  is  simple  requiring  algebraic  manipulations 
only.  The  resultant  matrix  equation’s  sparsity  is  reduced  in  a  negligible  manner. 
However,  since  the  boundary  nodes  are  eliminated  from  the  system  matrix,  any  in¬ 
cremental  increase  in  computational  cost  due  to  the  change  in  sparsity  is  offset  by 
solving  a  smaller  matrix. 
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Fig.  1.  Outer  boundary  finite  element  nodes  lying 
along  mutually  orthogonal  curves 
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Fig.  2.  Surface  current  (H-polarization)  calculated  using  second-order  BT 
(BT2),  fourth-order  BT  (BT4)  and  the  analytic  (Exact)  solution  for  an 
8X  perfectly  conducting  cylinder. 


Fig.  3.  RCS  for  the  problem  of  H-polarized  scattering  from  an  8X. 
perfectly  conducting  cylinder  calculated  using  second-order  BT  (BT2), 
fourth-order  BT  (BT4)  and  the  analytic  (Exact)  solution. 
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1.  Summary 

In  a  1997  paper  [1]  the  author  presented  formulas  for  total  energy  and  time-average  power  radiated 
from  charge-current  distributions.  The  expression  Wrad(T),  identified  as  total  energy  radiated  during  a  time 
interval  T,  was  actually  the  total  energy  which  flowed  in  from  the  generator  field,  which  was  presumed  to 
support  the  electric  field  induced  by  the  flow  (if  not,  the  flow  would  change  so  its  electric  field  as  the  sum  of 
near-field  and  radiation-field  components  would  match  the  generator  electric  field).  It  was  implied  that  the 
flow  was  zero  at  the  beginning  and  end  of  interval  T. 

The  integrand  of  the  time  integral  for  Wrad{T )  could  not  directly  be  interpreted  as  instantaneous  power 
flow;  various  time  integrations  by  parts  were  necessary  to  identify  the  primary  components  of  instantaneous 
power  flow. 

When  the  expression  for  Wroj(T)  was  applied  to  an  accelerating  electron  it  was  found  to  contain  primary 
terms  representing  the  increase  in  kinetic  energy  plus  the  radiated  energy  during  T. 

In  the  present  paper  the  author  presents  a  double  spatial  integral  over  charges  and  currents  which  does 
represent  instantaneous  power  flow  in  from  the  generator  field.  Evaluation  of  its  predominant  terms  for  the 
accelerating  electron  of  [1]  yields  the  instantaneous  time  rate  of  increase  of  kinetic  energy  plus  the  power 
radiated.  Because  this  expression,  Eq.  (11),  includes  the  kinetic  energy  term  it  is  labeled  P7 „(<),  the  power 
tn  from  the  generator  field  and  delivered  to  the  charges  and  currents. 

The  time-average  value  of  Pin(t)  at  a  single  frequency  agrees  with  that  formula  in  [1]. 

2.  Derivation  of  the  Formula  for  Instantaneous  Power  flow. 


This  formula  will  be  derived  straightforwardly  from  the  scalar  and  vector  potentials  <j>  and  A  from  [1], 
They  are 

4«"*“C ■*) = £  (!  +  =?£  w 

4«A.(r,t)  =  i  £  +  ~  •<&,  =  Jr  £  (2) 

with  the  definitions 


F\  =  J(r',t'  =  t-R/c),  [q]  =  q(r',t')7  X  =  R  =  (r-r>),  R=\R\  (3) 

One  can  verify  directly  that  these  yield  the  near  fields  (subscript  ”ne”)  and  radiation  fields  (”ra”)  according 
to  the  prescriptions 


Ene  =  —  V^ne  -  Ane,  Era  =  -V<f>ra  -  Ar, 


(4) 


4  =  Vx  Ane,  Hra  =  V  x  Ara, 

where  the  Panofsky-Phillips’  expressions  [2]  for  these  field  components,  reproduced  from  [1],  are 


(5) 
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(6) 


E(r,t)  =  Ent  +  ETa, 

Ene=j-i 

4jreo  7s,  #3  c  ./s„  ^ 

Erg  =  1  -  l  X3  X-~^, 

4xeoc2  Js„  E3 


The  associated  magnetic  field  is 


ff(r,t)  -  +  Hr 


n<  45r/5o  ii3  4irc/Sa  i22 

The  instantaneous  power  flowing  into  a  charge-current  distribution  on  surfaces  from  the  generator  field 


Pjn(t)  =  —  S  ( Ene  +  Era)t  •  J{t)dS 


Substitution  of  (4)  and  integration  of  the  V<£  •  J-terms  by  spatial  parts  yields 

Pin(t)  =  JL  jf  (^ne9  +  7*  X,e  +  +7*  Xa)^5  (10) 

Now,  upon  sutstituting  the  potential  expressions  (1)  and  (2)  we  arrive  at 

gjf  <<Sj£  JS'4{7(f,i)*^(f',i'=l-|)  +  c«(f',i')^9(i'.<) 

+^,()cZ£^+ZMiBi(f.,0c]}  (id 

The  bracketed  expression  in  (11)  evaluates  to  zero  when  integrated  over  either  Sa  or  S'a,  (or  over  V  or  V'  in 
the  volume  representation)  as  appropriate  (See  App.  A). 

For  charges  moving  in  one  direction  the  terms  for  t'  sa  t  agree  with  Miller’s  measure  [3]  of  the  instan¬ 
taneous  power  leaving  current  and  charge  traveling  on  a  straight-wire  dipole  as  a  stun  of  the  time  rate  of 
change  of  a  current  energy  W/(t)  =  /  I{s,t)2dS  and  a  charge  energy  WQ(t)  =  J  Q(s,t )  dS. 

The  remainder  of  this  paper  will  evaluate  (11)  converted  to  volumetric  form  for  the  accelerating  electron 
of  [1],  to  derive  the  formula  for  instantaneous  power  radiated  oc  (v)2. 

3.  Evaluation  of  (11)  for  an  Electron  Accelerating  in  the  z-direction. 

For  the  electron  moving  in  the  z-direction,  p  designating  a  transverse  position,  we  can  write  very 
generally 

q(r,t)  =  g(p,z,t)  =  g(p,z- J  v(r)dr\  j^gdV  -  -e  ,  (12) 


J(r,t )  =  J(p,z,t)  =  azgv(t) 
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V.J  =  !£  =  g'v(t)  =  -i 


(14) 


d 

d(arg) 

Now  expand  qi^^t'  =  t  —  R/c )  about  t: 

=  t- R/c)=q(r,,t)~  +  +  -  (15) 

and  similarly  J(f’,t').  Then  P{n  for  volumetric  charges  and  currents  (J(amp/m)dS  — ►  J(amp/m2)dV, 
q(cottl/m2)dS  —*■  q(coui / mz)dV)  reads 

«»(«)  =  4iJdvJ  wjifanT.t)  .s.f/V.i)-  jfc.t)  +  f  (?)  V.<>  - 

+i(f.0'J[5(i‘'.<)-??'(>M)  +  |(?)  -  |(?)  ip',0  +  ""]}  (16) 

We  have  ignored  the  last  bracket  in  (11)  for  the  reason  given  earlier. 

4.  Evaluation  of  the  JJ-terms  in  Eq.  (16). 

4.1  the  J(r,  f)  Jjr1,  t )  —  term. 


For  an  electron  of  uniform  charge  density  this  term  evaluates  [1]  to 

py  =  §-t(lmkev2)  =  Wke,  (17) 

the  rate  of  change  of  kinetic  energy.  The  kinetic  mass  evaluates  to 

mke  —  rr~  ,  rn  =  electron  radius,  (18) 

IOjt  r0  x 

which  is  double  the  classical  mass  mcj  obtained  by  equating  the  electrostatic  energy  required  to  build  the 
electron  to  mejc2 

4.2  TheJ(r,t)J(rr,t)  —  term  in  Eq.  (16) 

Expanding 

Jir'it)  -  qif,t)v(t)  +  2 qv(t)  +  qv  (19) 

we  have,  for  this  term, 

P2  =  J dV  J  dV'q(r,  t)v(t)  ^r',  t)  +  2 qv  +  guj  (20) 

The  q-  and  9-terms  evaluate  to  zero  when  integrated  over  V'.  Introducing  vv  =  d(vv)/dt  —  (t5)2  we 
obtain 


P2~~1^2W{v)  +  1^{v) 


(21) 


The  first  term  in  P2  is  proportional  to  a  second  time  derivative  of  kinetic  energy.  Since  this  term  integrates 
to  zero  over  the  total  time  of  the  flow  we  regard  it  as  a  fluctuation  in  the  kinetic  energy.  The  second  term 
is  3/2  of  the  instantaneous  power  radiated  by  the  electron. 
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4.3  The  J(r,t)J¥,t)  -  term  in  Eg.  (16) 

If  we  add  to  this  term  the  term  with  r  and  ¥  interchanged,  divide  by  2,  and  replace  these  two  terms 
with  the  expansion  of 

^[j(r,t)J(¥,t)  +  J(¥,t)J(r,t)] 
we  obtain  for  the  /(F.tJJ^.fJ-term  in  (16), 

J3  =  i6 ™JdvJ 

The  term  within  the  first  set  of  brackets  evaluates  to  <  \tf§i  of  P2.  Here  r0/c  «  10_23sec.  The  last  bracket 
yields  terms  small  compared  to  Wke,  Wke  and  the  time  derivative  of  the  last  or  radiation  term  in  (21). 

5.  Evaluation  of  the  q$- terms  in  Eq.  (16). 

5.1  The  q(r,t)q(¥  ,t')  -  term 

Addition  to  this  term  of  the  term  with  r  and  ¥  interchanged  and  division  by  2  yields  a  double  integral 
proportional  to 

ft  f  dV  f  dV'^q(r,i)q(r',t).  (22) 

The  value  of  the  double  integral  is  independent  of  the  position  of  the  electron,  hence  its  time  derivative  is 
zero. 

5.2  The  q{f ,  t)q(¥,  t)  -  term  in  Eg.  (16) 

Using  (14)  this  double  integral  is  of  the  form 

P3  =  J<V  (23) 

We  can  integrate  each  J-derivative  separately  through  the  entire  electron  sphere  and  obtain  zero  for  P3. 

5.3  The  q(r,t)q(T',t)  —  term  in  (16) 

As  in  Sec.  5.1  we  can  interchange  r  and  ¥  in  the  integrand,  add  it  to  the  integrand  above  and  divide 
by  2.  The  resultant  integral  is  proportional  to 

P*~FtJdV  f  dV,R*(T,  tW,  0  =  0  (24) 

when  we  substitute  (14)  for  each  q  and  note  the  value  of  the  double  integral  is  independent  of  t. 

5.4  The  q(f,t)q[¥,t)  —  term  in  Eq.  (16) 

This  term  is 

P*  =  J *V  J dV'R2q(r, OftF'.O.  R2  =  \P  ~  pf  +  (*  “  z>?  (25) 
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With  the  identity 


=  ?(r,  + 

and  introducing  J(f,t)  and  Jir'.t)  via  (14)  we  change  P5  to  read 

P5  =  ~4^6'cJdVJdVR{dir~d'z  d?~\~  dz  dz>  /• 

In  [1,  (25)-(28)]  we  evaluated  the  second  term  of  P5  as 

l  fi0e2 


3  47TC 


(v) 


(See  App.  B).  By  analogy,  the  first  term  of  P5  evaluates  to 


(26) 


(27) 


(28) 


(29) 


6.  Summary  Evaluation  of  the  Predominant  Terms  in  Ptrl(<)  of  (16) 

The  sum  of  terms  Pi  through  P5  is 

Pin(t)  =  wke(t)  -  | ^Wke(t)  +  | (30) 

The  last  term  is  the  correct  expression  for  instantaneous  power  radiated  by  the  electron.  The  first  term  is 
the  rate  of  increase  in  kinetic  energy;  the  unexpected  second  term  is  regarded  as  a  fluctuation  in  the  kinetic 
energy  since  it  integrates  to  zero  over  the  duration  of  the  flow  ,  In  words,  (30)  says 


Power  from  generator  field  =  P,-n(t) 

=  Rate  of  increase  in  kinetic  energy,  with  a  fluctuation  term, 

+  Power  radiated.  (31) 

7.  The  Sinusoidal  Steady  State,  Time-average  Power  Radiated. 

For  sinusoidal  steady  state  we  integrate  (11)  without  its  last  bracketed  terms  over  time  T  =  2 ir/u,  and 
substitute 

J(r,  t)  =  i[J(r)e,"t  +  complex  conjugate)  —  Real  [J(r)eJk,f]  etc.  (32) 

into  (11)  for  current  flow  on  surfaces.  There  is  no  time  average  change  in  kinetic  energy  of  the  charged 
particles  so  we  obtain,  for  this  type  of  flow, 

Pin(w)  =  Prad(w)  =  -±Realj>s  t(r)*J(r)dS 

=  jf  jf  dSdS'  [j'(r)  .  J{¥)  -  c2f  (r)^(r')]  (33) 

This  is  the  same  expression  quoted  in  [1,  (31)],  from  which  the  correct  radiation  resistance  was  derived 
for  a  straight  dipole  with  a  triangular  distribution  of  sinusoidal  current  and  for  a  circular  loop  carrying  a 
spatially  uniform  sinusoidally  varying  current. 
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7.  Conclusions  from  These  Results 

The  general  Eq.  (11)  is  the  result  of  rigorous  radiation  mathematics  but  does  not  explain  fully  the 
radiation  physics.  Radiation  is  ordinarily  ascribed  to  charge  acceleration ,  but  there  is  no  acceleration  term  in 
q{r,i)q{r,  tj.  Eq.  (11)  only  says  that  we  can  evaluate  the  power  flowing  into  the  charge-current  distribution 
by  integrating  over  space  the  “coupling”  between  current-current  pairs  and  charge-charge  pairs. 

The  physics  of  radiation  are  even  more  obscure  in  Eq.  (33),  where  there  is  no  obvious  acceleration  term. 
It  merely  says  that  we  can  evaluate  the  time-average  power  by  integrating  over  current  pairs  and  charge 

pairs,  each  pair  multiplied  by  the  Green  function. 

However,  we  can  condude-from  the  application  of  the  volumetric  form  of  (11)  to  the  electron  model  that 
Pin(t)  represents  first-order  rate  of  increase  in  kinetic  energy  of  the  charged  particles  plus  radiated  power. 
The  first  and  second  time  derivatives  of  kinetic  energy  can  be  calculated  from  the  kinematics  of  the  charged 
particles  themselves. 
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Appendix  A.  Proof  that  the  Bracketed  Expression  in  Eq.  (11)  is  zero. 

We  label  the  two  terms  T1  and  T2,  respectively.  Upon  substituting  V  •/  =  -<$  for  q(r,  t )  and  similarly 
for  q{r',t)  and  neglecting  the  common  c-factor,  we  write  the  identity 

V.  heJjmr*}  =Tl+7(r,i).vF^^-')+p',+7-^,<')(^1  HI) 


&  j—  L  R2  j 

We  expand  the  last  term  in  cartesian  coordinates.  Then  we  write  the  corresponding  expansion  for 


which  equals  T2  plus  a  term  like  the  last  J  •  V-term  in  (Al). 

Upon  adding  these  two  identities,  (Al)  and  the  corresponding  one,  we  find  that  the  expansion  of  the 
last  term  in  (Al)  cancels  the  corresponding  expansion  of  the  last  term  in  (A2),  and  we  obtam 


7(r,t)m M+V'.  [W&jtsj-n+n 


v.  [j(r,0 


The  divergent  terms  can  be  integrated  over  Sa  or  S',  or  over  V  or  V',  respectively,  of  the  full  charge-current 
.  distribution  to  yield  zero. 

Therefore  the  bracketed  expression  m(ll),Tl  +  T2,  is  zero.  QED. 

Appendix  B.  Evaluation  of  the  second  term  in  (27). 
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The  | p  —  p^-term  yields  zero;  only  the  —2 zz'  within  the  brackets  yields  a  finite  answer.  Integrating  by 
spatial  parts  over  z,z',  we  obtain 

n  =  J dp  JdjfJdzJ dz'J(p,z,t)jtf,  z\t)  (B2) 

With 


*  dJ 

J(r,t)  =  qv+qv  =  +  «(?,<)« 

OZ 

and  similarly  for  J(y,t),  we  see  the  ^ju-term  integrates  over  z  to  zero.  The  final  result  is  (28). 


m 
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1.0  Abstract 

This  paper  describes  recent  advances  in  the  use  of  computational  electromagnetics  (CEM)  for  use 
by  the  radar  cross  section  (RCS)  measurement  community  to  design  improved  calibration  stan¬ 
dards.  We  show  how  uncertainties  in  computed  calibration  data  directly  impact  measured  data. 
We  present  numerical  examples  for  two  calibration  targets,  a  right  circular  cylinder  and  a  comer 
reflector,  demonstrating  the  sensitivity  of  the  computations  to  several  CEM  parameters.  We  fur¬ 
ther  show  that  these  sensitivities  are  endemic  to  a  wide  class  of  integral  equation-based  CEM 
codes. 


2.0  Introduction 

Radar  cross  section  (RCS)  measurement  ranges  are  complex  systems  used  to  measure  the  RCS  of 
arbitrary  targets.  Recent  advances  in  instrumentation  radar  technology  have  greatly  improved  the 
accuracy  of  measured  RCS  data,  as  well  as  the  speed  with  which  it  is  acquired.  New  sophisticated 
software  tools  transform  broadband  and/or  wide-angle  RCS  data  into  various  types  of  images, 
useful  for  analyzing  target  scattering  characteristics.  Various  aircraft  components  have  maximum 
RCS  requirements  levied  upon  them,  so  accurate  RCS  measurements  are  needed  to  verify  compli¬ 
ance.  This  has  resulted  in  an  increased  emphasis  on  quantifying  sources  of  uncertainty  in  the  RCS 
measurement  process. 

One  aspect  of  the  RCS  measurement  process  which  has  received  relatively  little  attention  is  the 
calibration  process.  Calibration  is  accomplished  by  measuring  the  electric  field  scattered  by  the 
target  under  test,  e£T,  and  then  repeating  the  measurement  using  a  calibration  standard,  e£T. 

The  result  is  then  correlated  with  the  known  scattered  field  of  the  calibration  standard,  Eca]  ,  at 
the  frequencies,  polarizations,  and  aspect  angles  of  interest,  to  compute  the  normalized  scattered 

electric  field  of  the  target,  e^“  ,  according  to 


_>scat 

Etgt 


(CO,  6)  = 


-i.meas 

Etgt 


(CO,  0) 


-*.meas 

ECal 


(co,  0) 


^ exact 
’  ECal 


(CO,  0) 


(EQ1) 
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Equation  1  shows  that  uncertainties  in  the  exact  scattered  field  of  the  calibration  standard  estab¬ 
lish  an  absolute  lower  bound  for  the  uncertainty  of  the  overall  measurement. 

Traditionally,  a  conducting  sphere  is  the  target  which  has  been  used  most  often  for  calibration, 
primarily  because  it  is  one  of  the  few  objects  whose  theoretical  RCS  can  be  computed  exactly. 
Conversely,  other  targets  have  not  been  used  for  calibration  purposes  because  of  the  lack  of  high- 
quality  theoretical  scattering  data.  The  emergence  of  powerful  new  computational  electromagnet¬ 
ics  (CEM)  tools  can  remove  this  barrier  [1]. 

Modem  CEM  software  provides  a  means  to  generate  highly  accurate  theoretical  scattering  data 
for  arbitrarily  shaped  objects,  especially  those  based  on  the  frequency-domain  surface  integral 
equation  formulation.  These  codes  discretize  the  scatterer  surface  and  then  solve  the  integral 
equation  to  estimate  the  electric  current  density  on  the  discretized  surface.  Finally,  the  estimated 
currents  radiate  the  far-zone  scattered  fields.  Because  of  the  numerical  nature  of  the  codes,  the 
computed  scattered  fields  are  not  exact,  but  rather  approximations  subject  to  uncertainties.  The 
two  primary  areas  in  which  uncertainties  arise  are  the  surface  discretization  and  the  integral  equa¬ 
tion  solution. 

Our  research  indicates  that,  far  from  being  turn-key  systems,  modem  CEM  tools  must  be  used 
carefully  to  generate  sufficiently  accurate  scattering  data  from  even  simple  shapes.  Our  accuracy 
requirements  are  driven  by  the  measurement  calibration  process  defined  above,  and  is  probably 
more  stringent  than  many  other  requirements  --  we  typically  require  calculated  data  to  be  within 
0. 1  -0.25  dB  of  the  true  value.  Specifically,  we  have  noticed  considerable  sensitivity  to  two  factors: 
discretization  density  and  the  Combined  Field  Integral  Equation  (CFIE)  parameter  a.  We  demon¬ 
strate  this  for  two  cases:  a  right  circular  cylinder  and  a  comer  reflector. 


3.0  “Squat”  Cylinder 

The  Air  Force  Research  Laboratory  Advanced  Compact  Range  (ACR)  has  adopted  a  family  of 
“squat”  cylinders  as  its  primary  calibration  standard.  The  “squat”  cylinder  is  oriented  edge-on,  so 
that  its  RCS  is  invariant  with  azimuth.  Two  mature  body  of  revolution  (BOR)  codes  have  been 
used  to  generate  theoretical  data  for  the  cylinders:  JRMBOR  [4]  and  CICERO  [3].  Both  of  these 
codes  are  widely  used  throughout  government  and  industry.  Figure  1  shows  the  RCS  as  a  function 
of  frequency  for  the  “450”  cylinder  (4.5  inches  in  diameter,  2.1  inches  in  height). 

Several  comments  can  be  made  regarding  the  results  plotted  in  Figure  1 .  First,  note  the  large  (~0.2 
dB)  discontinuity  in  the  CICERO  results  at  14  GHz,  HH  polarization.  This  is  caused  by  using  a 
single  generating-arc  discretization  for  frequencies  10-14  GHz,  and  another  for  14-18  GHz.  Both 
discretizations  were  intended  to  be  sufficient  for  their  respective  frequency  bands,  but  clearly 
there  is  a  significant  jump  associated  with  transitioning  between  such  discretizations.  The  JRM¬ 
BOR  generating-arc  discretization  was  adaptively  generated  at  each  frequency,  so  no  abrupt 
jumps  in  the  JRMBOR  results  can  be  attributed  to  this  cause.  (Interestingly,  an  exponentially- 
tapered  discretization  was  attempted  in  an  effort  to  more  finely  sample  the  current  density  at  the 
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FIGURE  1. 450  “Squat”  Cylinder  Computed  RCS  vs.  Frequency 
comers  of  the  cylinder;  unfortunately,  this  approach  led  to  numerical  instabilities  in  the  results, 
especially  for  electrically  larger  cylinders.) 


Other,  more  subtle,  discontinuities,  are  evident  in  the  Figure  1  data,  and  these  have  been  deter¬ 
mined  [2]  to  be  caused  by  discontinuities  in  the  number  of  Fourier  modes  included  in  the  solution. 
Both  BOR  codes  adaptively  choose  the  number  of  Fourier  modes  based  on  the  electrical  girth  of 
the  target,  so  that  at  higher  frequencies  more  Fourier  modes  are  included  in  the  solution.  However, 
this  causes  jumps  in  the  RCS  data  when  each  additional  Fourier  mode  is  included.  When  the  num¬ 
ber  of  Fourier  modes  is  set  to  a  predetermined  number  in  the  BOR  code’s  input  file,  the  disconti¬ 
nuities  vanish. 


4.0  Comer  Reflector 

The  trihedral,  or  comer  reflector,  is  another  object 
that  is  used  for  RCS  calibration  purposes.  Its 
advantages  include  ease  of  fabrication,  large  RCS 
relative  to  physical  size,  and  broad  monostatic  scat¬ 
tering  pattern  at  aspects  near  the  maximum  RCS 
orientation.  The  trihedral  is  shown  in  Figure  2.  It’s 
faces  are  each  three  inches  square,  and  the  plate 
thickness  is  0.375  inch.  The  edges  are  beveled  to 
45  degrees.  The  trihedral  is  oriented  such  that  it 
presents  its  maximum  RCS  along  the  positive  x 
axis  and  one  of  its  edges  lies  in  the  xz  plane.  In  this 
orientation,  the  vertically  and  horizontally  polar¬ 
ized  RCS  values  are  very  nearly  identical. 


FIGURE  2.  Discretized  trihedral  geometry 
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Frequency  (GHz)  Frequency  (GHz) 

FIGURE  3.  Calculated  trihedral  RCS  results  for  Codes  A,  B,  and  C  (horizontal  polarization,  maximum 
RCS  aspect,  10  nodes  per  wavelength,  a  =  0.5). 


The  trihedral  is  not  a  BOR,  so  it  cannot  be  analyzed  using  the  BOR  codes  discussed  earlier. 
Instead,  general  purpose  three-dimensional  codes  must  be  used.  We  have  chosen  three  state-of- 
the-art  integral  equation  codes,  each  with  slightly  different  features.  The  first  code  uses  an  FFT- 
based  method  to  iteratively  solve  a  generalized  form  of  surface  integral  equation.  The  second  code 
uses  the  fast  multipole  method  (FMM)  to  iteratively  solve  the  combined  field  integral  equation 
(CFIE).  The  third  code  solves  the  CFIE  directly,  using  LU  decomposition.  We  will  denote  these 
Codes  A,  B,  and  C,  in  no  particular  order.  All  three  codes  work  from  the  same  triangular  facet 
model  of  the  scatterer.  (For  the  trihedral,  the  facet  representation  exactly  captures  the  true  geome¬ 
try,  but  this  is  not  true  in  general  for  curved  surfaces.) 

We  expected  Codes  A,  B,  and  C  to  provide  nearly  identical  RCS  results  for  the  trihedral  for  the 
same  input  parameters,  but  instead  initally  generated  results  which  varied  by  an  amount  greater 
than  we  were  willing  to  accept.  Figure  3  shows  the  calculated  trihedral  RCS,  horizontal  polariza¬ 
tion,  as  a  function  of  frequency.  We  see  that  Codes  A  and  C  (EFIE)  agree  very  well,  to  within  a 
few  hundredths  of  a  dB  across  the  frequency  band,  but  Code  B  (EFIE)  differs  from  these  two  by 
as  much  as  nearly  0.2  dB.  Since  we  are  interested  in  driving  measurement  uncertainty  as  low  as 
possible,  we  adopt  0.1  dB  as  the  uncertainty  threshold  for  the  computed  trihedral  RCS.  We  dis¬ 
card  the  results  from  Code  B  since  its  results  are  not  consistent  with  those  of  the  other  two  codes 
and  thus  we  cannot  consider  it  acceptable  for  our  purposes  (see  Figure  3).  In  the  course  of  running 
the  computations,  we  discovered  that  the  Code  C  results  were  very  sensitive  to  the  way  in  which 
the  singular  kernel  of  the  MFIE  was  handled  in  the  integrations.  Failure  to  adequately  extract  this 
singularity  produced  results  in  error  by  several  dB.  Perhaps  more  noteworthy  is  the  failure  of  the 
CFIE  formulation  to  provide  the  correct  results.  The  CFIE  results  for  Codes  B  and  C  are  errone¬ 
ous  by  0.3-0.7  dB  across  the  frequency  band.  These  results  were  generated  using  a  CFIE  alpha 
parameter  of  a  =  0.5,  such  that  the  EFIE  and  MFIE  are  weighted  equally. 
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Code  B  Uncertainty 
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FIGURE  4.  Computed  trihedral  RCS  error  for  Codes  B  (left)  and  C  (right).  Results  are  for  horizontal 
polarization,  3.2  GHz  frequency,  maximal  RCS  aspect 


A  well-known  “rule  of  thumb”  for  CFIE  codes  is  to  set  a  =  0.5  and  use  a  grid  density  of  10  to  15 
nodes  per  wavelength.  A  parametric  study  was  conducted  to  better  understand  the  role  of  grid 
density  and  a  in  the  CFIE  results.  The  frequency  was  fixed  at  3.2  GHz  and  aspect  was  fixed  along 
the  positive  x  axis.  Code  A  is  not  based  on  the  classical  CFIE  formulation,  so  variation  with  a  was 
not  applicable.  However,  Code  A  results  varied  only  within  0.03  dB  for  grid  densities  from  7  to  30 
nodes  per  wavelength.  These  results  are  shown  in  Figure  4.  Code  B  was  ran  for  grid  densities  of  7, 
10  15  20,  and  30  nodes  per  wavelength,  and  for  a  varying  from  0  (MFIE)  to  1  (EFIE).  Figure  4 
shows  that  in  order  to  drive  the  error  below  0. 1  dB,  Code  B’s  a  must  be  greater  than  0.9,  with  grid 
density  greater  than  14  nodes  per  wavelength.  Code  C  was  run  identically  (though  a  grid  density 
of  30  nodes  per  wavelength  was  too  large  a  problem  for  our  computer)  and  the  results  are  shown 
in  Figure  4.  This  shows  much  the  same  trend  as  for  Code  B,  albeit  the  error  was  more  well- 
behaved  as  a  function  of  grid  density  and  a.  The  lesson  to  be  learned  is  that  a  naive  CFIE  result 
may  not  be  accurate,  even  for  very  fine  grid  densities. 

An  estimate  of  the  uncertainty  in  the  calculated  trihedral  RCS  can  be  found  by  taking  the  maximal 
difference  among  multiple,  independently  computed  results.  Here,  we  assume  each  code  is  imple¬ 
mented  correctly  (i.e.,  validated)  and  each  individual  result  is  as  accurate  as  the  code  is  capable  of 
producing.  Using  Codes  A  and  C,  we  estimate  the  computed  RCS  uncertainty  to  be  0.02  dB 
across  the  2.875-3.525  GHz  band.  This  uncertainty  in  the  computed  RCS  of  the  trihedral  provides 
a  lower  bound  on  the  uncertainty  of  any  RCS  measurement  calibrated  by  the  trihedral. 


5.0  Conclusions 

We  have  discussed  the  RCS  test  process  from  the  viewpoint  of  controlling  measurement  uncer¬ 
tainty  using  advanced  CEM  tools.  We  have  shown  that  measurement  uncertainty  is  fundamentally 
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limited  by  uncertainty  in  the  theoretical  RCS  of  the  calibration  standard.  We  have  shown  that 
careful  application  of  first-principle  based  CEM  scattering  codes  can  reduce  this  lower  bound  to  a 
few  hundredths  of  a  dB  for  the  trihedral,  but  this  comes  at  the  cost  of  rigorous  analysis  of  the  code 
input  parameters  and  scattered  field  results.  Furthermore,  we  have  shown  that  several  well-known 
“rules  of  thumb”  for  moment-method  codes  may  fail  to  achieve  the  necessary  accuracy  under  cer¬ 
tain  circumstances. 
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ABSTRACT:  A  case  study  of  flat  plate  scattering  images  obtained  by  the  analytical  bistatic  k  space  and  experimental 
swept  frequency  ISAR  methods  is  presented.  The  key  advantage  of  the  bistatic  k  space  image  is  that  a  single  excitation  is 
required,  i.e.,  one  frequency  /  one  angle.  This  means  that  prediction  approaches  such  as  MOM  only  need  to  compute  one 
solution  at  a  single  frequency.  Bistatic  image  Fourier  transform  data  are  obtained  by  computing  the  scattered  field  at 
various  bistatic  positions  about  the  body  in  k  space.  Experimental  image  Fourier  transform  data  are  obtained  from  the 
measured  response  to  a  bandwidth  of  frequencies  over  a  target  rotation  range. 

This  paper  compares  both  image  approaches  as  applied  to  a  flat  plate.  Full  details  are  presented  in  [1]. 


Analytical  and  Experimental  Images:  Experimental  images  are  the  Fourier  transform  of  the  backscatter  reflected  field  as 
a  function  of  frequency  and  angle,  E{a>,  6) .  Analytical  bistatic  k  space  images  [1]  are  the  Fourier  transform  of  the 
bistatic  field  computed  from  a  current  distribution  J  as  a  function  of  down  and  cross  range  wave  number  k, 

E(kdouTt  >  &  cross')  •  This  quantity  is  computed  from  a  single  current  distribution  J(o)  from  the  generalized  radiation  integral 

JJ  (**  dS 

where  the  wave  vector  k  takes  on  down  and  cross  range  values  in  k  space, 

k=(Ak+k0)kdowI+Ak  kctoss . 

where  Ak  is  the  k-space  bandwidth  in  the  down  and  cross  range  directions  and  ko  =  2n/X  is  the  nominal  wave  number 
corresponding  to  the  excitation  frequency  to.  The  resolution  in  each  spatial  direction  is  Ar  =  2ji/Ak  which  can  be  recast  in 
terms  of  fractional  wave  number  space  (similar  to  fractional  frequency  bandwidth),  Ar  =  2V(Ak/ko).  Experimental 
resolution  for  comparison  is  Ar  =  (X/2)/(Af/f0)  for  downrange  and  Ar  =  (X/2)/(A0)  for  cross  range.  The  bistatic  k-space 
bandwidth  will  be  twice  the  experimental  bandwidth  for  the  same  resolution  Ar  due  to  the  bistatic  one  way  path  length 
change. 

Table  1  compares  various  image  issues  between  the  experimental  swept  frequency  ISAR  approach  and  the  analytical 
bistatic  k-space  approach. 

Experimental  and  Bistatic  k  space  image  comparison:  Due  to  space  limitations,  only  2-D  cases  are  presented.  The 
interested  reader  is  directed  to  [1]  for  the  1-D  cases.  The  target  is  a  6”  (0.15  m)  square  plate  viewed  from  three  aspect 
angles:  normal  to  the  plate;  edge  on  at  10°  elevation;  and  at  45°  azimuth  along  the  diagonal  at  10°  elevation.  The 
illumination  frequency  is  10  GHz  with  a  wavelength  of  1.18”  (0.03  m).  The  normalized  plate  dimensions  are  g/X  =  5.08 
and  A/X2  =  25.9  for  edge  length  and  plate  area  respectively. 

The  analytical  and  experimental  image  comparisons  each  used  a  Hanning  window  function.  The  experimental  data  used  a 
single  resolution  Ar  =  1.25  X.  The  analytical  images  used  three  resolutions,  which  were  0.4, 0.8,  and  1.2  times  the  Ar  of  the 
experimental  value.  This  was  done  to  demonstrate  how  resolution  affects  the  resulting  images. 
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The  experimental  resolution  is  1.25  Xc  =  1.47”  =  0.0375  m  corresponding  to  a  fractional  bandwidth  Af/f  of  0.4  (8-12GHz) 
and  an  ISAR  rotation  angle  of  20°.  The  bistatic  k  space  image  resolutions  were  1.5, 1.0,  and  0.5  X  corresponding  to  0.015, 
0.030,  and  0.045  m  respectively.  This  bracketed  the  experimental  value.  The  nominal  bistatic  viewing  angles  were  38°, 
57°,  and  1 15°  respectively. 

The  data  scales  between  the  experimental  and  computed  images  have  been  kept  the  same. 

Computational  Parameters:  The  square  plate  bistatic  k  space  images  were  computed  using  MOM3D  [3],  The  mesh 
model  had  5000  triangles  resulting  in  7400  unknowns.  The  geometry  symmetry  option  was  utilized  so  that  the  system 
matrix  was  decomposed  into  two  smaller  matrices  of  3725  and  3675  unknowns  each.  The  sample  density  was  286 
unknowns  /  X2 .  The  image  extent  was  set  to  0.5  m. 

Image  Comparisons:  View  1  is  perpendicular  to  the  plate.  Figure  1,  for  horizontal  polarization.  The  dominant  scattering 
for  this  case  is  specular  with  a  peak  RCS  of  0‘specular  =  4 7T-jr  =  7.6  m2  =  8.8  dBsm .  The  2-D  images  show 
backscattered  radiation  to  be  uniformly  coming  from  the  entire  plate.  Coarser  resolutions  distribute  energy  over  the  larger 
resolution  cell. 

The  experimental  image  shows  a  lower  level  source  emanating  from  behind  the  plate  probably  due  to  edge  to  edge 
diffraction  sometimes  called  talking  edges.  This  return  is  in  the  category  of  a  multiple  bounce  and  hence  images  later  in 
time  down  range  of  the  plate.  This  effect  is  not  seen  in  the  computed  images.  Residual  signal  processing  side  lobes  are 
seen  in  the  computed  2-D  images. 

View  2  is  normal  to  the  plate  edges  at  an  elevation  angle  of  10°.  The  dominant  scattering  is  due  to  edge  diffraction:  leading 
edge  for  horizontal  (parallel)  polarization.  Figure  2;  and  trailing  edge,  or  alternately  traveling  wave  reflection  for  vertical 
(perpendicular)  polarization,  Figure  3. 

The  H  pol  images  show  the  leading  edge  as  the  dominant  scattering  mechanism  along  with  two  edge  waves  reflecting  from 
the  rear  vertices.  The  leading  edge  return,  from  the  1-D  images,  is  close  to  the  theoretical  knife  edge  value  of 

credge  =  i2  In  —  0.0074m2  =  -21.3  dBsm .  The  experimental  image  also  shows  a  scattering  center  behind  the  plate 
due  to  rear  comer  to  comer  diffraction  that  arrives  later  in  time.  The  edge  wave  mechanism  in  the  computed  images  shows 
a  spatial  distribution  with  a  peak  at  the  rear  vertex.  This  corresponds  to  the  interpretation  that  the  edge  wave,  upon 
reflection,  loses  energy  via  radiation  as  it  travels  back  towards  the  front  comer.  Thus  we  see  in  the  image  a  decaying 
intensity.  The  coarser  resolution  computed  images  distribute  the  return  over  a  larger  spatial  extent. 

The  V  pol  images  show  the  trailing  edge  as  the  dominant  scattering  mechanism.  The  experimental  image  also  shows  a 
scattering  center  behind  the  plate  due  to  a  second  bounce  surface  wave,  i.e.,  from  the  rear  edge  to  the  front  and  a  second 
reflection  from  the  back  edge.  This  arrives  later  in  time  and  experimentally  images  behind  the  plate.  This  is  not  seen  in 
the  computed  images. 

The  experimental  V  pol  images  suggest  a  scattering  mechanism  from  the  front  edge  of  the  plate  which  in  not  observed  in 
the  computed  images.  We  believe  this  is  due  to  the  trailing  edge  reflected  surface  wave  and  that  if  the  plate  were 
electrically  longer  we  would  not  see  the  leading  edge.  The  computed  images  do  not  show  a  leading  edge  scattering  center. 
The  dominant  mechanism  for  this  viewing  angle  is  the  trailing  edge  traveling  wave  reflection.  In  the  computed  1-D  and  2- 
D  images  the  peak  occurs  at  the  rear  and  decays  towards  the  front  of  the  plate.  This  corresponds  to  the  interpretation  that 
the  traveling  wave,  upon  reflection,  loses  energy  via  radiation  as  it  travels  back  towards  the  front  edge.  Thus  the  image 
decays  in  intensity.  This  phenomenon  is  not  observed  in  the  experimental  data  because  the  swept  frequency  nature  of  the 
image  causes  the  stationary  phase  locations  to  coincide  with  the  edges  of  the  plate.  The  leading  edge  return  in  the 
experimental  images  is  interpreted  as  the  trailing  edge  reflected  surface  wave  that  has  not  completely  decayed  by  the  time 
it  reached  the  front.  This  happens  because  of  the  multiple  bounce  traveling  wave  arriving  later  in  time.  Because  the 
traveling  wave  mechanism  images  differently  for  the  computed  and  experimental  approaches,  the  peak  values  do  not 
correspond  to  each  other. 
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View  3  is  along  the  plate  diagonal,  azimuth  angle  of  45°,  at  an  elevation  angle  of  10°,  Figure  4  and  5  for  H  (parallel)  and  V 
(perpendicular)  polarizations  respectively.  The  dominant  mechanisms  are  non-specular  edge  and  traveling  waves. 

The  H  pol  images  show  scattering  from  the  four  tips  of  the  plate.  The  two  dominant  tips  are  the  mid  vertices  which  reflect 
the  edge  traveling  wave.  The  1.0  and  0.5  X  resolution  images  show  these  reflected  edge  waves  on  the  two  front  illuminated 
edges  with  the  peak  amplitudes  near  the  reflection  point.  The  rear  vertex  scattering  center  is  probably  due  to  edge  waves 
that  “turned  the  first  comer”  and  then  reflected  from  the  rear  most  vertex  and  /  or  possibly  due  to  tip  diffraction  since  the 
plate  is  viewed  at  an  elevation  of  10°.  The  front  tip  return  has  the  lower  magnitude  and  is  due  to  tip  diffraction. 

The  experimental  images  show  the  front  vertex  with  a  lower  amplitude  than  the  rear.  The  opposite  occurs  in  the  computed 
images.  The  reason  for  this  is  not  clear.  It  is  conjectured  that  die  measured  front  tip  image  is  the  phasor  sum  of  tip 
diffraction  and  the  remainder  of  the  reflected  edge  wave  which  are  phase  subtracting  to  produce  a  smaller  return. 

The  V  pol  images  show  scattering  from  the  rear  vertex  and  is  due  to  the  reflected  surface  traveling  wave.  As  before  with 
traveling  wave  mechanisms,  the  experimental  and  computed  image  magnitudes  differ.  The  computed  results  show  a  lower 
amplitude  spatially  distributed  return  while  the  experimental  image  shows  a  higher  amplitude  return  localized  to  the  rear 
vertex.  The  computed  images  show  the  maximum  amplitude  near  the  aft  reflection  point  with  the  reflected  traveling  wave 
decreasing  in  intensity  as  is  propagates  toward  the  front.  The  1-D  measured  image  shows  a  small  return  from  the  front  tip 
which  may  be  the  stationary  phase  residual  of  the  reflected  traveling  wave. 

The  view  angle  in  elevation  is  less  than  the  angle  for  maximum  surface  wave  reflection  which  would  occur,  using  the  plate 
diagonal  length,  at  about  18°. 

Image  comparison  summary: 

•  Scattering  mechanisms  that  are  localized  image  similarly.  The  experimental  and  computed  images  are  similar  in  most 
respects  for  specular  and  leading  edge  scattering.  For  trailing  edge  diffraction  we  suspect  similar  images;  however,  we 
have  not  made  this  comparison  for  a  large  X  target. 

•  Multiple  bounce  scattering  mechanisms  image  differently.  The  experimental  images  show  multiple  reflections  further 
down  range  since  this  is  a  time  delay  mechanism.  This  agrees  with  our  physical  view.  The  bistatic  k  space  approach 
images  the  current  distribution  that  by  definition  exists  only  on  the  surface  and  thus  can  not  be  further  down  range 
since  there  is  no  time  variation.  The  computed  images  show  multiple  bounce  mechanisms  as  locations  on  the  body 
where  those  current  mechanisms  exist,  i.e.,  placement  of  radar  absorbing  material  at  these  locations  would  reduce  the 
scattering. 

•  Surface  traveling  and  edge  waves  image  differently.  (And  we  suspect  for  creeping  wave  mechanisms,  although  we 
have  not  made  that  comparison)  The  computed  images  tend  to  show  edge  and  traveling  waves  as  distributed  sources 
over  the  scattering  surfaces/edges  which  corresponds  to  this  physical  mechanism.  The  swept  frequency  measured 
images  show  these  mechanisms  at  the  body  fore  and  aft  end  point  locations.  In  this  case  the  bistatic  images 
correspond  closer  to  a  physical  view. 

Summary:  A  bistatic  k  space  imaging  technique  has  been  introduced  which  is  applicable  for  many  computational 
electromagnetic  algorithms.  Images  are  a  useful  diagnostic  tool  in  that  they  provide  a  pictorial  representation  of  those 
body  currents  which  contribute  to  radiation  in  specified  spatial  directions.  By  knowing  how  the  geometric  current 
distribution  radiates  energy,  it  then  becomes  easier  to  modify  the  currents  and/or  geometry  to  produce  user  desired 
scattering  or  radiation  results. 

The  bistatic  k  space  image  algorithm  utilizes  a  computed  body  current  distribution  and  a  generalized  k  space  radiation 
integral  to  compute  the  scattered  field  in  k  space.  The  Fourier  transform  of  this  data  is  then  the  image  in  one,  two  or  three 
dimensions.  This  image  technique  can  be  applied  to  monostatic  or  bistatic  scattering  or  to  antennas  since  only  one  body 
excitation  is  required. 
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Bistatic  k  space  images: 

•  Bistatic  k  space  images  are  not  the  same  as  experimental  images.  Both  methods  produce  similar  results  particularly 
for  localized  scattering  mechanisms  such  as  specular  and  edge  diffraction.  Experimental  images  show  multiple 
bounce  scattering  centers  delayed  in  time.  Analytical  k  space  images  show  the  current  distribution  producing 
radiation.  Surface  traveling  and  edge  waves  image  as  distributed  sources  in  the  analytical  k  space  method  as  compared 
to  local  end  point  regions  in  experimental  images.  Experimental  images  typically  have  two  way  path  length  changes 
while  the  bistatic  k  space  images  have  a  one  way  path  length  change. 

•  The  method  may  be  applied  to  any  EM  computational  code  that  produces  a  current  distribution,  e.g.,  method  of 
moments,  physical  optics,  finite  element  frequency  domain,  etc.  The  computed  current  distribution  must  reside  on  the 
body,  i.e.,  equivalent  surface  currents  removed  from  the  geometry  will  not  image  back  to  a  physical  source. 

•  The  method  is  computationally  cheap.  It  is  equivalent  to  computing  a  bistatic  or  antenna  radiation  pattern  from  just 
one  current  excitation. 

•  Resolution  is  limited  by  the  discreteness  of  the  EM  analysis  and  by  user  desired  limits  on  the  bistatic  radiation  angles. 

•  Image  focusing  and  smearing  is  not  a  problem  because  the  bistatic  field  is  computed  on  a  uniform  grid  in  k  space. 
Experimental  data,  in  comparison,  is  usually  on  a  circular  arc  region  that  must  be  projected  to  a  uniform  rectangular 
grid  for  input  to  an  FFT. 

•  One,  two  or  three-dimensional  images  may  be  obtained  with  a  resolution  limit  of  7J4. 

•  The  method  may  be  used  to  produce  images  for  antennas  and  for  bistatic  scattering  or  for  monostatic  scattering. 

•  Images  may  be  made  for  arbitrary  polarization,  e.g.,  cross  polarized  images  may  be  obtained. 
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1 .0A,  resolution 
Analytical 


Experimental  Data 
1 .251  resolution 


1.5X  resolution 
Analytical 


0.5X  resolution 
Analytical 


1. OX  resolution 
Analytical 


E  Perpendicular  to  Plate 


Traveling  Wave  Scattering 


Transform  Domain 


Fourier  T  ransform  Variables 

et  Excitation  Currents 
(What  the  image  represents) 
Focus/Image  Smear 


Time  delayed  multiple  bounce 

1- D  Images 

2- D  Images 

3- D  Images  _ 

Fidelity  for  distributed  current 
radiation  (traveling  waves) 


Image  in  direction  other  than 
excitation,  i.e.,  bistatic  images 
Cross  polarized  image 


Measured  backscatter  fields 


Wave  number  k  and  position  r,  kr 

Only  at  frequency  and  illumination  angle  of 

excitation  _ 

None  since  image  is  computed  directly  in  k 
space  on  an  orthogonal  grid  of  uniformly 
spaced  points 

-112,  limited  only  by  granularity  of  current 
representation,  typically  «W10,  requires 

twice  the  experimental  bandwidth _ _ 

Images  currents  which  radiate  in  direction  of 

image  _ _ _ _ 

Down  range  (radial)  in  k  space 
add  cross  range  in  k  space 
add  perpendicular  direction  in  k  space 
Images  the  radiation  as  a  distributed  source^ 


Frequency  and  time,  ot  for  down  range 

and  krfl  for  cross  range  _ 

Varies  over  the  excitation  frequency  and 

illumination  angle _ _ _ 

Rotation  angle  limited  to  small  angle 
without  using  focusing  algorithms. 
Inherently  a  circular  region  in  transform 

space _ _ _ 

Set  by  available  frequency  bandwidth  an 
small  rotation  angle  approximation 

Images  downrange  as  time  delay,  hence 

see  delayed  in  time _ _ _ 

Frequency  Sweep 
add  rotation  angle 

add  measurements  over  orthogonal  plan 
Because  of  frequency  sweep,  images  the 
end  points  of  distributed  source 


Straight  forward  application 
Straight  forward  application 


Very  difficult,  seldom  done 

If  separate  transmit  and  receive  feeds, 
rotate  one  feed  _ 


Table  I  Image  Issues  for  Bistatic  k-Space  and  Experimental 
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A  Hybrid  Periodic  Moment  Method  Formulation  for  Scattering 
from  Large  2-D  Wire  Arrays 

Peter  J.  Collins  and  J.  Paul  Skinner 
Air  Force  Institute  of  Technology 


Abstract 

We  develop  a  hybrid  method  of  moments  numerical  model  for  the  electromagnetic  scattering  from  large 
finite  planar  wire  arrays  The  method  incorporates  the  novel  concept  of  a  periodic  physical  basis  function 
to  dramatically  reduce  the  required  number  of  unknowns.  The  model  can  represent  an  arbitrary  number 
of  infinite  length  wires  surrounded  by  a  coplan ar  dielectric  slab. 


1  Introduction 

Periodic  screens  have  historically  held  great  interest  to  the  scientific  and  engineering  communities.  Often 
referred  to  as  “frequency  selective  surfaces”  for  their  frequency  filtering  properties,  they  find  a  wide  range 
of  application  from  optical  spectroscopy  to  microwave  antennas.  Their  practical  usefulness  has  motivated  a 
tremendous  effort  in  developing  numerical  analysis  methods  for  design  purposes. 

One  common  approach  approximates  the  finite  periodic  structure  with  an  infinite  structure  containing  the 
same  periodicity.  An  integral  equation  connecting  the  scattering  currents  to  the  incident  field  is  reduced  to  a 
system  of  equations  using  the  moment  method.  Floquet  theory  then  reduces  the  infinite  dimension  problem  to 
a  finite  one  over  a  single  reference  element.  Papers  by  Munk  et.al.  at  the  Ohio  State  University[l]  and  Mittra 
et.al.  at  the  University  of  IUinois[2]  are  excellent  examples  of  this  approach,  commonly  called  the  periodic 
moment  method. 

While  useful  in  many  applications,  the  periodic  moment  method  can  not  account  for  the  edge  effects 
present  in  all  real  geometries.  In  some  applications,  the  edge  effects  play  a  critical  role  in  the  electromagnetic 
properties,  motivating  recent  efforts  in  modeling  finite  arrays.  The  standard  moment  method,  while  capturing 
the  edge  effects,  is  often  limited  by  current  computer  resources  to  small  array  geometries.  This  shortcoming 
has  lead  some  authors  to  explore  hybrid  approaches. 

In  their  1988  article,  Cwik  and  Mittra  investigated  truncated  and  curved  strip  arrays  by  approximating  the 
induced  current  on  each  array  element  as  if  it  were  a  member  of  an  infinite  array  analogous  to  the  physical  optics 
approximation^].  The  authors  then  replace  the  edge  element  currents  by  those  calculated  for  a  much  smaller 
array,  combining  the  fields  radiated  by  these  currents  through  the  standard  radiation  integral.  Felsen  and 
Carin  have  also  investigated  finite  arrays  using  a  novel  ray  field-FIoquet  mode  approach  which  provides  insight 
into  the  time  domain  scattering  from  truncated  periodic  structures  [4]  [5].  Both  of  these  techniques  provide 
a  reasonable  first  order  approximation  to  the  effects  caused  by  truncating  periodic  structures.  However, 
since  neither  technique  accurately  accounts  for  the  coupling  between  the  edge  elements  and  those  in  the  array 
interior,  they  provide  poor  results  for  cases  where  edge  effects  predominate. 

In  the  following  paragraphs,  we  present  a  new  approach  to  modeling  the  coupling  between  the  edge  elements 
and  the  interior  elements  in  a  large  array  based  on  the  concept  of  a  “physical  basis  function.”  Although  couched 
in  different  terms,  the  physical  basis  function  concept  was  originally  proposed  by  Morita[6],  and  later  Tew  and 
Tsai[7]  in  an  effort  to  apply  the  moment  method  to  infinite  planar  structures.  Burnside,  Yu,  and  Marhefka  then 
applied  it  to  the  wedge  diffraction  problem,  combining  the  geometrical  theory  of  diffraction  with  the  moment 
method[8].  To  our  knowledge,  however,  the  current  work  represents  the  first  application  of  the  physical  basis 
function  concept  to  periodic  structures. 
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Figure  1:  2-D  Wire  Array  Equivalent  Geometry 


2  Formulation 

The  core  of  the  physical  basis  function  concept  involves  a  fairly  straight-forward  application  of  a  priori  knowl¬ 
edge  of  the  induced  current  behavior  to  the  moment  method.  Consider  the  currents  impressed  on  a  large 
planar  array  by  a  plane  wave  illumination  (large  implying  many  wavelengths).  Starting  from  an  edge  and 
moving  toward  the  center  of  the  array,  there  will  be  some  point  where  the  current  amplitudes  become  approx¬ 
imately  constant  over  a  large  central  section  of  the  finite  array.  Over  this  portion  one  can  use  the  periodic 
moment  method  to  determine  the  representative  admittance  or  impedance  of  these  elements.  This  physical 
basis  function  can  then  be  used  along  with  the  edge  element  basis  functions  to  create  a  coupling  matrix  which 
relates  the  unknown  current  expansion  coefficients  to  the  known  incident  field.  The  coupling  matrix  explicitly 
incorporates  the  influence  of  the  “infinite”  portion  of  a  large  finite  array  into  the  small  finite  array  analysis 
in  a  manner  analogous  to  how  the  interconnection  matrix  of  diakoptic  theory  combines  the  impedances  of 
individual  structural  elements  into  a  composite  structure  matrix[9].  This  technique  represents  a  departure 
from  traditional  hybrid  methods  characterized  by  the  Cwik  and  Mittra  “physical  optics”  approach. 

To  illustrate  the  method,  consider  the  2-D  wire  array  geometry  shown  in  Figure  1  illuminated  by  a  TMz 
plane  wave.  Assuming  the  number  of  wires  in  the  array,  N,  is  large  enough  such  that  the  two  array  edges 
are  uncoupled,  one  can  break  the  large  finite  array  problem  into  two  semi-infinite  array  problems.  Once 
the  induced  currents  are  found  for  each  semi-infinite  problem,  the  first  f  currents  from  each  problem  can 
be  combined  with  the  appropriate  phase  shift  in  the  radiation  integral  to  obtain  the  desired  scattered  fields. 
While  the  presence  of  the  dielectric  slab  could  be  accounted  for  by  a  non-free  space  Green’s  function,  we 
choose  to  develop  expressions  in  the  plane  wave  spectral  domain  where  the  planar  boundaries  are  accounted 
for  by  simple  Fresnel  coefficients. 

We  start  the  analysis  by  considering  the  semi-infinite  wire  array  in  a  homogeneous  media  with  the  material 
parameters  of  the  dielectric  slab.  Using  the  surface  equivalence  theorem,  we  define  a  new  geometry  where 
impressed  currents  radiate  into  the  homogeneous  media.  The  equivalent  problem  contains  an  infinite  number 
of  unknown  currents  maVing  a  numerical  solution  via  a  traditional  moment  method  approach  impossible.  In 
order  to  make  the  problem  tractable,  we  assume  the  current  amplitudes  vary  significantly  only  over  the  first 
B  elements.  The  remaining  elements  have  approximately  equal  amplitude  currents  which  differ  by  a  Floquet 
phasing  factor  dependent  on  the  incident  plane  wave  direction.  This  assumption  reduces  the  problem  to  that 
of  finding  ( B  +  1)  unknowns. 

We  next  form  an  integral  equation  relating  the  unknown  currents  to  the  known  incident  field.  As  is  common 
in  these  formulations,  the  total  tangential  electric  field  boundary  condition  is  used  to  obtain  an  integral  equation 
of  the  form 

Ei(p)  =  "X  //  OT-p'l)  p  e  Swires  (1) 
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where  sWires  denotes  the  wire  surfaces.  We  solve  Equation  (1)  by  expanding  the  unknown  currents  with 
appropriate  known  basis  functions.  In  this  case,  B  impulse  basis  functions  and  one  physical  basis  function  are 
required.  These  take  the  form 
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where  pq  —  qdxx.  Note  that  pq=0  locates  the  “edge”  of  the  physical  basis  function  and  s  denotes  the  incident 
plane  wave  propagation  direction.  With  this  choice,  we  are  effectively  treating  the  edge  wire  basis  functions  as 
perturbations  of  the  physical  basis  function.  Inserting  Equation  (2)  in  Equation  (1),  we  obtain  the  expression 
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Following  Harrington’s  moment  method  formulation,  we  define  an  inner  product  and  force  a  chosen  set  of 
(J5  +  1)  testing  functions  to  be  orthogonal  to  the  error  function[10].  For  the  wire  geometry,  the  filamentary 
current  nature  suggests  a  Dirac  delta  testing  function.  In  order  to  avoid  the  Green’s  function  singularity, 
we  place  the  test  location  a  wire  radius  away  from  the  basis  function.  We  also  define  the  testing  function 
associated  with  the  physical  basis  function  to  be  nonzero  only  over  a  chosen  reference  wire  element  to  keep  the 
coupling  matrix  elements  bounded. 


8t  =  6(p-pt),  t  =  0,l,...B  (4) 

where 
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Performing  the  inner  product,  we  arrive  at  the  following  matrix  equation 

(j0f .  t  =  0,l, ...B  (5) 

V  g=0  6=1  / 

which  can  be  solved  for  the  unknown  expansion  coefficients,  J&.  Once  the  currents  are  known  for  the  two 
semi-infinite  array  problems,  we  can  combine  the  first  y  currents  from  each  in  the  radiation  integral  to  obtain 
the  desired  scattered  fields. 

In  order  to  account  for  the  dielectric  slab,  we  need  to  convert  the  previous  spatial  domain  expressions 
to  the  spectral  domain.  The  spectral  domain  form  also  generally  converges  faster  than  the  spatial  domain 
summation.  Consider  the  first  term  in  Equation  (5).  This  expresses  the  fields  radiated  by  the  physical  basis 
function  in  terms  of  a  collection  of  cylindrical  waves.  Using  the  one-sided  Poisson  sum  formula  described  in 
our  earlier  paperfll],  we  transform  the  cylindrical  wave  expansion  to  a  plane  wave  expansion  of  the  form 


The  function  denoted  by  T  is  the  “T-factor”  introduced  by  Munk  which  accounts  for  the  multiple  reflections 
of  the  plane  wave  radiated  by  the  physical  basis  function.  For  a  detailed  description,  see  Munk’s  work  on 
periodic  surfaces  in  stratified  media[12][l]. 
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The  remaining  terms  in  Equation  (5)  express  the  fields  radiated  by  the  edge  wires.  To  obtain  a  plane  wave 
expansion,  we  use  the  “Array  Scanning  Method”  first  introduced  by  Munk  and  Burrell[13j.  The  basic  ASM 
procedure  starts  with  the  spatial  domain  field  from  an  edge  element 


Etr  ( Pt )  =  -P„ |) 


(7) 


We  then  create  a  “pseudo-array”  around  the  edge  element  and  express  the  fields  in  the  spectral  domain. 


e-j0[Pi  -pa3m  -Px’m  (b-l)dx] 
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where 


After  multiplying  by  the  “T-factor”  to  account  for  the  dielectric  slab,  we  recognize  the  resulting  sum  as  a 
Fourier  series  whose  central  term  is  the  fields  radiated  by  the  original  edge  element  in  the  presence  of  the 
dielectric  slab.  We  extract  the  central  term  using  an  appropriately  defined  inner  product  to  obtain 
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Combining  Equations  (6)  and  (9),  we  have  a  spectral  domain  equivalent  to  Equation  (5)  in  the  presence  of 
the  dielectric  slab. 

El  (pt)  *  EFBF  (pt)  +  £  ETe  C Pt ),  t  =  0, 1, . . .  B  (10) 
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Equations  (5)  and  (10)  form  the  core  of  the  wire  array  codes  we  use  to  generate  the  results  in  the  next  section. 


3  Results 

To  demonstrate  the  accuracy  of  our  formulation,  we  present  the  bistatic  echowidth  of  a  50  wire  array  in  free 
space  illuminated  60°  off-normal.  The  wires  are  placed  0.1A  apart.  The  plot  in  Figure  2  contains  three  traces; 
the  standard  moment  method  calculation  containing  50  unknowns,  the  “physical  optics”  approximation  where 
we’ve  used  10  unknowns  on  each  edge  and  one  unknown  for  the  center  30  wires,  and  the  subject  hybrid  method 
using  the  same  number  of  unknowns  as  the  “physical  optics”  approach.  Clearly  the  “physical  optics”  approach 
does  not  completely  capture  the  scattering  mechanisms.  To  see  why,  we  examine  the  actual  induced  scattering 
currents  calculated  by  each  method. 

Figure  3  contains  the  induced  current  magnitudes.  According  to  the  figure  the  “physical  optics  currents 
next  to  the  inner  array  wires  are  in  error.  The  explanation  lies  in  the  fact  these  currents  were  computed  in  free 
space  instead  of  next  to  an  “infinite”  wire  array.  Our  hybrid  method  more  accurately  predicts  the  currents 
precisely  because  this  coupling  is  inherently  included  in  the  coupling  matrix. 

As  a  final  demonstration,  we  encapsulate  the  previous  array  in  a  0.5A  thick  dielectric  slab  with  a  relative 
permittivity  of  2.0.  Figure  4  shows  the  hybrid  method’s  ability  to  include  planar  boundaries.  The  induced 
current  magnitudes  for  a  20°  off-normal  incident  angle  axe  presented.  While  the  hybrid  method  accurately 
predicts  the  edge  current  behavior,  the  slab  increases  the  coupling  between  wires  indicating  more  edge  elements 
may  be  required. 
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Figure  3:  Induced  Current  Magnitude  on  Wire  Array 
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Figure  4:  Induced  Current  Magnitudes  on  Wire  Array  in  Dielectric  Slab 

4  Conclusion 

In  the  previous  paragraphs,  we  developed  a  hybrid  periodic  moment  method  solution  for  the  electromagnetic 
scattering  from  large  finite  planar  wire  arrays.  Based  on  the  concept  of  a  periodic  physical  basis  function,  the 
method  explicitly  incorporates  the  coupling  between  the  edge  wires  and  the  central  portion  of  the  array  in  a 
moment  method  coupling  matrix.  In  addition  to  providing  a  more  accurate  solution  when  compared  to  previous 
hybrid  techniques,  the  method  provides  a  computational  advantage  over  standard  moment  method  solutions 
by  effectively  removing  the  link  between  the  array  size  and  the  required  number  of  unknowns.  This  efficiency 
is  demonstrated  in  a  paper  currently  submitted  to  the  IEEE  transactions  on  Antennas  and  Propagation  where 
we  apply  the  method  to  more  general  slotted  radome  geometries. 
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1.  Introduction 

The  accuracy  of  numerical  methods  for  solution  of  scattering  problems  is  in  general  poorly  under¬ 
stood  [1].  Codes  are  validated  by  comparison  with  test  cases  which  are  often  much  simpler  than 
real-world  scatterers.  With  any  numerical  method,  the  most  important  questions  are  “How  long 
does  it  take  to  get  a  solution?”  and  “How  accurate  is  that  solution?”  A  theoretical  understanding 
of  the  convergence  of  numerical  scattering  methods  which  answers  these  questions  would  be  of 
great  value  to  practitioners  of  computational  electromagnetics. 

A  convergence  theory  consists  of  two  parts:  a  bound  on  the  solution  error  and  an  estimate  of 
the  condition  number  of  the  method,  in  terms  of  properties  of  the  scatterer  and  the  discretization 
scheme.  Such  a  theory  exists  for  the  boundary  element  method  applied  to  Laplace’s  equation 
[2].  This  work  has  been  extended  to  the  dynamic  case  [3,  4]  but  the  resulting  bounds  apply  only 
asymptotically  in  the  limit  as  the  discretization  length  vanishes.  For  large  scatterers,  computa¬ 
tional  costs  can  be  excessive  even  at  the  lowest  possible  discretization  densities.  Residual-based 
bounds  are  also  available  [5,  6],  but  in  order  to  estimate  discretization  error,  residuals  must  be 
computed  using  a  finer  discretization  than  that  used  to  solve  the  problem.  Global  phenomena 
such  as  resonance  also  reduce  the  effectiveness  of  both  of  these  types  of  bounds. 

To  overcome  these  difficulties,  a  convergence  theory  for  computational  electromagnetics  must 
take  into  account  not  only  the  local  nature  of  a  problem  but  also  global  properties  as  well. 
For  integral  equation  methods,  this  requires  a  knowledge  of  the  spectrum  of  the  operator.  For 
separable  geometries  such  as  the  cylinder  and  the  sphere,  the  spectrum  is  known,  but  the  spectrum 
is  unavailable  in  closed  form  for  more  complex  scatterers.  In  this  paper,  we  give  the  approximate 
spectrum  of  the  EFIE  for  an  infinite  conducting  strip  and  a  cavity,  and  use  this  to  obtain  the 
condition  number  of  the  EFIE  for  these  scatterers.  By  comparing  the  continuous  and  discrete 
operators,  we  also  obtain  the  spectral  error  introduced  by  the  discretization. 


2.  TM  Polarization 

By  transforming  the  EFIE  for  a  TM-poIarized  field  incident  on  a  perfectly  conducting  strip  to 
a  spectral  basis,  the  operator  becomes  diagonally  dominant,  and  the  spectrum  of  the  EFIE  is 
approximated  by  the  self-interaction  terms.  The  self-interaction  of  modes  of  the  form  e1  oPx  is 

=m  In  (ip  +  y/l  -  P2^j 
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where  0  =  p/L,p  =  0,  ±1,  ±2, . . .,  and  L  is  the  length  of  the  strip  in  wavelengths.  The  first  term 
is  the  limiting  value  for  an  infinite  plane,  and  higher  order  terms  represent  the  effect  of  scattering 
by  the  edges  of  the  strip.  Equation  (1)  breaks  down  for  \0\  —  1,  and  is  replaced  by 
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for  the  self  interaction  of  the  surface  wave.  Figure  1  compares  the  spectrum  of  the  impedance 
matrix  for  a  strip  of  width  20  A  to  the  values  of  AP  given  by  Eqs.  (1)  and  (2). 
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Figure  1:  Spectrum  of  Z  for  strip,  L  —  20,  nx  —  10.  Pluses:  computed.  Circles:  theoretical,  Eqs.  (1),  (2),  and  (6).  Squares: 
numerical  evaluation  of  Ap. 

Discretizing  the  EFIE  with  a  finite  basis  leads  to  a  cutoff  of  the  spectrum  near  the  accumulation 
point  at  the  origin.  The  eigenvalue  of  the  moment  matrix  Z  with  the  smallest  magnitude  is 
Amin  —  —ir)/nx  where  nx  =  X/h  is  the  discretization  density  or  number  of  points  per  wavelength. 
The  corresponding  mode  has  the  maximum  spatial  frequency  representable  in  the  finite  basis,  and 
its  self-coupling  is  the  most  strongly  localized.  The  largest  eigenvalue  is  Amax  ~ 
and  corresponds  to  the  surface  wave  mode  with  a  spatial  frequency  of  k0.  Due  to  long  range 
coupling  by  fields  propagating  parallel  to  the  strip,  the  surface  wave  is  antiresonant.  A  strong 
incident  field  is  required  to  excite  the  mode,  and  its  eigenvalue  (which  represents  the  energy  stored 
by  the  mode  for  a  unit  surface  current)  grows  with  the  strip  length.  The  condition  number  of  Z 
can  be  approximated  by  the  pseudo-condition  number  |Amax|/jAmin|,  which  is 

k(Z)  ~  (2/3)t ixL'V.  (3) 

If  a  near-neighbor  preconditioner  is  employed,  the  growth  in  iteration  count  with  nx  can  be 
mitigated,  since  the  preconditioner  accurately  models  the  localized  interaction  between  the  high 
frequency  eigenfunctions.  Since  the  preconditioner  does  not  capture  the  long  range  interactions 
which  determine  the  largest  eigenvalues,  however,  the  condition  number  will  still  scale  as  iV1/2  as 
the  size  of  the  scatterer  increases. 

Discretization  also  shifts  Ap  by  a  small  amount  A Ap  which  approximates  the  discretization  error 
in  the  spectrum  of  Z.  Evaluating  the  interaction  of  the  modes  elk°^Xn  using  the  moment  matrix  Z 
and  comparing  to  the  continuous  limit  yields  the  relative  spectral  error  Eb{P,nx,a)  =  A Ap/Ap. 
b  is  the  sum  of  the  orders  of  the  testing  and  expansion  functions,  where  the  delta  function  basis 
(point  matching)  is  order  0,  a  piecewise  constant  basis  (pulse  function)  is  order  1,  and  a  piecewise 
linear  basis  (triangle  function)  is  order  2.  The  parameter  a  specifies  the  relative  shift  ah/2  of  the 
locations  of  the  testing  and  expansion  functions.  There  are  two  error  components:  (i)  sampling 
error,  due  to  aliasing  of  high  frequency  components  of  the  singular  kernel  by  the  finite  local  basis, 
and  (ii)  smoothing  error,  caused  by  inaccurate  interpolation  of  low  frequency  components. 

We  first  assume  exact  integration  of  the  moment  matrix  elements.  For  small  P/nx,  it  can  be 
shown  that 

Eo  ~  2iln  [2sin(7ra/2)]%1;  E\  ~  -i/3g(a)nx2  -  (7r2/?2/6)%2;  Eb  ~  -(ir2bp2/S)nx2,  b  >  1  (4) 
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where  g(a)  =  Li2(-eiwa)  -  Li2(-e~i1TQ)  and  Li  is  the  polylogarithm  function.  E0  consists  only 
of  sampling  error;  the  two  terms  of  E1  are  sampling  and  smoothing  errors,  respectively;  and 
smoothing  error  dominates  for  &  >  1.  The  divergences  of  Eb  correspond  to  point  matching  at 
locations  where  the  derivative  of  the  expansion  function  is  undefined.  E0  has  a  minimum  at 
a  =  1/3,  where  the  leading  spectral  error  term  given  in  Eq.  (4)  vanishes.  Figure  2  shows  the 
relative  spectral  error  over  0  <  0  <  nx/2  for  nx  =  10.  For  the  b  =  0  discretization,  the  expansion 
function  is  taken  to  be  S(x  +  hf 6)/2  +  5(x  -  h/6)/ 2  and  the  testing  function  is  5 (x),  m  which 
case  the  spectral  error  is  0(n'2).  Figure  3  compares  the  error  of  the  numerical  solution  to  the 
EFIE  for  a  cylinder  for  various  values  of  b.  As  predicted  by  the  theory,  the  error  for  the  b  -  0 
discretization  is  smallest,  and  increases  with  the  order  of  the  discretization.  Since  E0  depends 
strongly  on  the  value  of  a,  this  discretization  scheme  is  more  sensitive  to  irregular  discretization 
than  the  higher  order  bases. 


Figure  2:  Relative  spectral  error,  TM  polarization,  nx  =  10. 
Solid  line:  b  =  0  (point  discretization,  a  =  1/3).  Dashed 
line:  b  =  1  (point  matching,  piecewise  constant  basis).  Dot¬ 
ted  line:  b  =  2  (point  matching,  piecewise  linear  basis). 


Figure  3:  Relative  current  error  for  cylinder,  for  =  2n,  TM 
plane  wave  incident  at  <j>  =  0.  Solid  line:  b  =  0  (point  dis¬ 
cretization,  a  =  1/3).  Dashed  line:  b  =  1  (point  matching, 
piecewise  constant  basis).  Dotted  line:  b  —  2  (point  match¬ 
ing,  piecewise  linear  basis). 


The  use  of  numerical  quadrature  to  evaluate  moment  matrix  elements  leads  to  an  additional 
sampling  error  component.  For  the  simplest  M- point  first  order  integration  rule,  the  error  is 

£J1;M~z21n2(MnA)"1.  (5) 


Unless  otherwise  specified,  the  numerical  results  in  this  paper  for  the  TM  polarization  employ  a 
single  point  integration  rule  with  analytical  self  term,  for  which  the  quadrature  error  is 

E'll  ~  i2\n(Tr/e)nxl.  (®) 

For  b  >  1,  the  spectral  error  is  of  the  same  order  as  E1>M-  With  any  standard  quadrature  rule  such 
as  the  trapezoidal  rule  and  its  generalizations  or  Gaussian  quadrature  with  unity  weight  function, 
the  order  of  the  spectral  error  in  Mnx  is  independent  of  the  order  of  the  quadrature  rule.  It  is 
well  known  that  in  order  to  reduce  error,  the  integration  rule  must  take  into  account  the  form  of 
the  singularity  of  the  kernel  [7]. 
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3.  TE  Polarization 

The  domain  of  the  EFIE  for  the  TE  polarization  is  the  Sobolev  space  H1/2,  which  contains  the 
square  integrable  functions  vanishing  at  the  edges  of  the  strip  [4].  Transforming  the  TE  EFIE  to 
a  spectral  basis  for  this  space  leads  to 


VV 1  ~  P2  _  J7_ 

2  2tt2 


In  (ip  +  y/l  -  P2) 


Py/T^W 


L  *  +  0(L  2),  L  — ^  oo 


(7) 


where  p  =  p/(2L)  and  p  —  1,2,....  The  self-interaction  of  the  surface  wave  is 

A2L  ~  [vV2/(27r)](l  +  i)L~1/2  -  (ivf^)L-1  +  0(Zr3/2),  L oo.  (8) 


Figure  4  compares  the  spectrum  for  a  strip  of  length  10  A  with  Ap.  In  constrast  to  the  TM  case,  the 
largest  eigenvalue  arises  from  the  highest  frequency  mode,  and  the  surface  wave  mode  is  resonant, 
so  that  its  eigenvalue  decreases  as  the  strip  length  increases.  From  Eqs.  (7)  and  (8),  the  condition 
number  of  Z  is 

k  ~  (tt/4 )nxL1/2  (9) 

which  scales  with  the  electrical  size  in  the  same  way  as  in  the  TM  case. 


Figure  4:  Spectrum  of  TE  EFIE  for  strip,  L  —  10,  b  =  1  discretization,  n\  =  10.  Pluses:  computed.  Circles:  theoretical, 
Eqs.  (7),  (8)  and  (10).  Squares:  numerical  computation  of  Ap. 

For  the  first  few  values  of  b,  the  discretization  error  for  the  TE  polarization  is 

E0  ~  esc2  (7ra/2),  El  ~  /?tan  (ira/2)  -  E2  ^  In  [2 sin  (7ra/2)]  -  (10) 

for  small  p/nx.  Figure  5  shows  the  spectral  error  for  several  different  discretizations.  If  a  =  0, 
the  leading  term  of  E\  vanishes.  In  this  case,  the  error  is  0(n ^2),  and  is  smaller  than  Eb  for 
b  >  1.  Figure  6  compares  the  surface  current  for  a  plane  wave  incident  on  a  cylinder  of  radius 
k0r  =  1/(27t)  obtained  using  pulse  basis  functions  and  point  matching  (6  =  1)  with  the  exact 
solution.  In  the  hypersingular  term  of  the  TE  EFIE,  the  derivative  of  the  pulse  basis  function 
leads  to  delta  functions  at  the  edges  of  the  pulse.  If  these  delta  functions  are  located  off  the 
cylinder  at  the  vertices  of  a  polygon,  the  results  of  Peterson,  et  al  [8]  are  recovered,  and  the 
point/pulse  result  is  much  poorer  than  pulse/triangle  (6  =  3).  If  the  delta  functions  are  moved 
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onto  the  cylinder,  the  current  error  norm  decreases  by  a  factor  of  16,  and  the  accuracy  is  roughly 
the  same  as  the  pulse/triangle  discretization.  For  a  strip,  the  6  =  1  discretization  is  more  accurate. 
Thus,  the  point/pulse  discretization  is  not  inherently  less  accurate  than  higher  order  schemes,  but 
is  more  sensitive  to  geometrical  modeling  errors  and  irregular  discretization.  _ 
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Figure  5:  Relative  spectral  error,  TE,  nx  =  10.  Solid  line;  b  Figure  6:  Surface  current  for  cylinder,  k0r  =  1/(20,  TE, 
=  1  (delta/pulse),  a  =  0.  Dashed  line:  b  =  2  (delta/triangle),  plane  wave  incident  at  0®.  Solid  line:  Mie  series.  Dotted 
a  =  1/3.  Dotted  line:  b  =  3  (pulse/triangle),  a  =  0.  line:  b  =  1,  delta  functions  off  cylinder.  Dashed  line:  b  =  1, 

delta  functions  on  cylinder. 

For  the  6  =  0  and  6  =  1  discretizations,  the  hypersingular  term  of  the  EFIE  can  be  evaluated 
analytically,  and  the  remaining  term  has  the  same  kernel  as  the  TM  EFIE,  so  that  the  spectral 
error  due  to  quadrature  for  the  TE  polarization  is  the  same  as  Eq.  (5).  For  6  >  1,  the  singularity 
of  the  hypersingular  kernel  can  be  reduced  by  partial  integration,  and  the  quadrature  error  also 
becomes  identical  to  the  TM  case. 


4.  Cavity 

For  two  parallel  strips  separated  by  a  distance  of  W  X,  the  spectrum  of  the  EFIE  consists  of  sums 
and  differences  of  the  self-  and  cross-coupling  of  modes  on  each  strip.  If  2 W  is  an  integer,  the 
mode  with  the  smallest  difference  eigenvalue  is  cos  (7 rx/L) .  The  eigenvalue  is  Ar  ~  i(rj7r/8)WL  + 
(7?\/2/16)W3/2Zr3.  This  leads  to  the  condition  number  estimate 


for  the  TM  polarization.  For  the  TE  case,  k  —  2nxL? /{kW),  so  that  the  growth  with  L  is  not  as 
fast,  but  discretization  length  continues  to  affect  conditioning. 

The  spectral  error  A Ap  makes  an  additional  contribution  to  the  resonant  eigenvalue.  If  the 
basis  is  such  that  quadrature  makes  the  only  contribution  to  sampling  error,  the  relative  spectral 


error  is 


EbM  —  1.8 


L2 

WMnx 


(12) 


The  growth  with  L  implies  poor  convergence  of  the  numerical  solution  for  the  cavity,  which  has 
been  observed  experimentally  [9].  For  a  cavity  such  that  L  =  15  and  W  =  4,  Eq.  (12)  leads 
to  the  condition  Mnx  »  101,  so  that  the  usual  ten  points  per  wavelength  discretization  density 
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combined  with  a  low  order  integration  rule  fails  to  model  the  resonant  behavior  of  the  cavity.  Since 
smoothing  error  affects  both  the  self-  and  cross-coupling  of  the  resonant  mode,  its  effect  on  the 
difference  eigenvalue  is  less  significant  than  that  of  sampling  error.  If  sampling  error  is  eliminated 
by  the  use  of  an  accurate  quadrature  rule  for  the  kernel  of  the  EFIE,  the  relative  spectral  error 
becomes  independent  of  the  cavity  size. 

The  EFIE  also  becomes  ill-conditioned  at  non-resonant  frequencies,  for  which  2 W  is  not  an 
integer.  In  this  case,  the  mode  with  the  smallest  difference  eigenvalue  has  spatial  frequency 
/?  =  y/a/W,  where  a  —  2 W  —  [2W\ .  The  real  part  of  the  eigenvalue  for  this  mode  is  Re{Ap}  ~ 
rjy/oiW /(2L).  The  real  part  of  Ap  approximates  the  magnitude  of  Ap,  so  that  for  the  TM  polar¬ 
ization, 


4  L3/2 
SVaW' 


(13) 


For  the  TE  polarization,  k  ~  nxL / (2VaW) .  Figure  7  shows  the  condition  number  of  Z  for 
L  =  10  as  a  function  of  width  W.  The  decrease  in  condition  number  as  W  increases  is  due  to 
the  tilting  of  the  direction  of  propagation  of  the  fields  radiated  by  the  mode  closest  to  resonance. 
As  the  direction  moves  away  from  normal,  radiation  loss  increases,  the  eigenvalue  grows,  and  the 
condition  number  decreases. 


w 


Figure  7:  Condition  number  of  Z  for  parallel  strips,  L  —  10. 
Pluses:  computed  value.  Dashed  line:  theory,  off-resonance, 
Eq.  (13).  Circles:  theory,  at  resonance. 


Figure  8:  k(Z),  parallel  strips.  Circles:  computed,  at  reso¬ 
nance  ( W  =  2).  Pluses:  computed,  off-resonance  (W  =  2.2). 
Solid  line:  theory,  at  resonance,  nx  =  oo.  Dashed  line: 
theory,  at  resonance,  nx  =  10.  Dash-dot  line:  theory,  off- 
resonance.  (Predicted  curves  scaled  by  a  factor  of  1.5.) 


Figure  8  shows  the  condition  number  as  a  function  of  L  at  resonance  (W  =  2)  and  off-resonance 
(W  =  2.2).  At  resonance,  the  smallest  eigenvalue  approaches  the  origin  along  the  imaginary  axis. 
Since  sampling  error  is  imaginary,  the  resonant  eigenvalue  eventually  stops  moving  towards  the 
origin  as  the  cavity  size  increases.  Away  from  resonance,  the  smallest  eigenvalues  are  closely 
spaced  on  a  circle  which  approaches  the  origin  along  the  real  axis,  so  that  sampling  error  does 
not  keep  them  away  from  the  origin.  Thus,  the  condition  number  of  the  impedance  matrix  grows 
faster  with  problem  size  for  a  non-resonant  frequency  for  large  L  than  at  resonance. 

In  addition  to  ill-conditioning,  resonant  modes  also  cause  a  posteriori  residual-based  error 
bounds  [5,  6]  to  break  down.  These  bounds  rely  on  the  closeness  of  fractional  order  Sobolev 
norms  to  the  inverses  of  the  TE  and  TM  EFIEs.  For  resonance-regime  structures,  long  range 
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coupling  produces  eigenvalues  which  depend  on  the  scatterer  size  and  cause  these  norms  to  differ 
significantly  from  the  operator  inverses,  so  that  the  Sobolev  norm  of  the  residual  no  longer  provides 
a  good  estimate  of  the  solution  error.  The  effect  is  most  severe  for  the  cavity  at  resonance,  but 
also  occurs  for  the  cavity  away  from  resonance  and  for  the  strip  as  well.  These  eigenvalues  also 
cause  the  residual  itself  to  become  a  poorer  indicator  of  solution  convergence. 

5.  Conclusion 

We  have  given  the  condition  number  scaling  for  a  conducting  strip  and  resonant  cavity  for  the 
TM  and  TE  polarizations,  and  identified  three  sources  of  ill-conditioning:  increasing  discretization 
density,  surface  wave  self-coupling,  and  resonance.  Changing  to  a  second-kind  integral  equation 
eliminates  the  first  two  sources,  but  real  (not  internal)  resonances  still  cause  ill-conditioning. 
Near-neighbor  preconditioners  reduce  the  growth  rate  with  discretization  density,  but  do  not 
eliminate  the  growth  with  scatterer  size. 

We  have  also  determined  the  spectral  error  introduced  by  discretization  for  piecewise  polyno¬ 
mial  bases.  The  results  show  that  error  can  be  smaller  for  simpler  discretization  schemes,  but 
there  is  a  trade-off  between  the  complexity  of  the  discretization  and  the  sensitivity  to  irregu¬ 
lar  discretization  and  geometrical  modeling  error.  In  addition,  with  a  proper  choice  of  basis  and 
quadrature  rule,  the  spectral  error  is  independent  of  problem  size  for  both  the  strip  and  the  cavity. 
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Abstract 

The  hp— edge  finite  elements  for  steady-state  Maxwell’s  equations  are  studied.  Numer¬ 
ical  evidence  of  exponential  convergence  for  certain  hp- discretizations  is  presented. 

The  hp-FEM  for  Maxwell’s  equations.  This  note  provides  a  brief  update  of  our  work 
on  the  hp-adaptive  Finite  Element  Method  for  steady-state  Maxwell’s  equations.  In  the  hp- 
methods,  we  can  vary  locally  element  size  h  and  order  of  approximation  p.  The  discretization 
proposed  in  [4,  13]  generalizes  Nedelec’s  rectangular  elements  of  the  first  type  [9]  and  his 
triangular  elements  of  the  second  type  [10].  Our  two-dimensional  implementation  of  the 
hp-FEM  with  hierarchical  shape  functions,  2Dhp9Q-EM ,  utilizes  the  idea  of  constrained 
approximation  [11],  An  example  of  an  hp- mesh  is  given  in  Fig.  1.  Elements  vary  in  size  and 
different  shades  correspond  to  different  orders  of  approximation.  The  method  developed  in 
[4,  13]  guarantees  stability  and  convergence  of  the  approximation  [4,  8].  As  an  illustration, 
Fig.  2  presents  the  solution  of  the  ’two  cylinders’  problem  [7]  on  the  hp- mesh  of  Fig.  1.  The 
solution  is  oscillation- free  in  the  entire  range  of  frequencies  u,  including  u  — ►  0. 


Figure  1:  Example  of  an  hp- mesh 


Figure  2:  Two  cylinders  problem:  Ex 
component 


There  are  at  least  two  arguments  in  favor  of  using  hp-discretizations  for  practical  compu¬ 
tations.  The  first  argument  stems  from  the  work  of  Ihlenburg  and  Babuska  [6]  who  studied 
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how  order  of  approximation  p  affects  pollution  error  (related  to  phase  error)  in  FE  simu 
lations.  The  authors  showed  that  as  the  order  of  approximation  increases,  the  pollution 
error  significantly  decreases.  This  means  that  higher  order  elements  not  only  lower  the  in¬ 
terpolation  error  of  the  numerical  solution  but  also  stabilize  Galerkin  discretization.  Thus, 
if  possible,  the  higher  order  elements  should  be  preferred  in  computations.  Obviously,  high 
p  leads  to  elements  of  large  size.  In  order  to  adequately  approximate  fine  scale  geometry, 
smaller  size  elements  of  low  order  are  often  needed.  It  is  for  these  problems  the  co-existence 
of  small  elements  of  low  order  with  the  large  elements  of  high  order  in  the  same  mesh  becomes 
crucial. 

The  second  and  more  frequently  used  argument  relies  on  the  ability  of  Ap-methods  to 
deliver  exponential  rates  of  convergence.  In  the  actual  computations,  an  a-posteriori  error 
estimation  [2]  is  used  to  produce  the  Ap-mesh  which  minimizes  the  error  for  a  prespecified 
number  of  degrees  of  freedom  or,  equivalently,  minimizes  the  problem  size  within  a  prescribed 
error  tolerance. 

The  superiority  of  the  Ap-adaptive  discretizations  over  classical  uniform  meshes  and  A- 
adaptive  or  p-adaptive  methods  is  usually  demonstrated  by  showing  that  only  an  optimal 
distribution  of  p  and  A  can  guarantee  the  exponential  convergence  for  a  wide  class  of  prob¬ 
lems,  including  those  with  irregular  and  singular  solutions  [12].  In  this  note  we  would  like 
to  present  two  numerical  examples  illustrating  the  possibility  of  such  a  convergence. 


Exponential  convergence  result  for  a  regular  solution:  Oblique  wave  problem. 

We  solve  the  steady-state  Maxwell’s  equations  in  a  square  domain  ft  =  (0,  l)2,  with  Dirichlet 

boundary  conditions.  The  boundary  data  and  the  impressed  current  correspond  to  the  exact 

solution:  . 

Ex  =  —kn2  sin (kn  • x )  Ey  —  kni  sm(kn  ■  x) 

k  —  u)  =  3.0,  n  =  (ni,  n2)  =  (0.6, 0.8) 

The  n  vector  intersects  with  computational  meshes  at  an  oblique  angle.  For  regular  solutions, 
the  pmethod  alone  should  deliver  exponential  convergence.  We  discretize  the  domain  with 
a  mesh  of  6  x  6  elements,  using  uniform  p-refinements  with  p  =  1,2,3, 4.  The  convergence 
results  are  summarized  in  Fig.  3-4. 

The  error  is  measured  in  L2-norm  and  #(cur/)-seminorm.  The  convergence  rates  for 
the  prefinements  are  compared  with  (algebraic)  convergence  rates  corresponding  to  uniform 
A-refinements. 

Exponential  convergence  result  for  a  singular  solution:  Diffraction  problem.  We 
consider  the  diffraction  of  a  plane  wave  on  a  semi-infinite  conducting  screen.  The  problem  is 
set  up  in  a  square  domain  ft  =  (-1,  l)2  shown  in  Fig.  5,  with  Dirichlet  boundary  conditions 
that  correspond  to  the  exact  solution  of  the  diffraction  problem  in  the  entire  plane. 
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Figure  3:  Smooth  problem:  exponen-  Figure  4:  exponential  convergence  in 

tial  convergence  in  L2-norm  for  the  p-  H(curl)- seminorm  for  the  p-method 

method 


{f  [(2*r)i  sin  i(*  -  *~)]  +  F  [(2*r)»  sin  | (<*  +  #”]  } 


0mC  =  - 
^  2 


F(u)  —  |e~^  ~ 

<?(*)- j£(z)  :=  I*  e~*"jt2  dt - 
Jo 


Fresnel  integrals 


Figure  5:  Diffraction  problem  Figure  6:  Error  in  the  imaginary  part 

of  Ex  near  the  singularity 

Our  initial  uniform  mesh  consists  of  8  x  8  linear  elements.  To  reach  exponential  rates  of 
convergence,  we  discretize  the  domain  with  geometrically  graded  meshes  using  a  Babuska- 
type  strategy.  Away  from  the  singularity,  we  conduct  uniform  p-refinements,  whereas  near  it 
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we  perform  up  to  9  levels  of  /i-refinements.  The  details  on  the  construction  of  adaptive  meshes 
will  be  presented  elsewhere.  Fig.  7  and  Fig.  8  demonstrate  the  evidence  of  exponential  rate 
of  convergence  for  our  Ap-adaptive  meshes.  For  comparison,  the  error  plots  for  ^-adaptive 
meshes  of  linear  elements  are  also  presented.  It  is  evident  from  the  plots  that  the  /i-adaptivity 
may  provide  only  algebraic  rates  of  convergence. 


Figure  7:  Screen  problem:  exponen-  Figure  8:  exponential  convergence  in 

tial  convergence  in  L2-norm  for  the  hp-  /7-(cuW)-seminorm  for  the  hp- method 

method 


A  sample  solution  on  a  representative  intermediate  mesh  is  presented  in  Fig.  9  -  12. 

To  conclude,  we  present  in  Fig.  6  the  distribution  of  the  point-wise  error  in  the  imaginary 
part  of  the  Ex  component.  The  plot  shows  the  error  in  the  vicinity  within  .0039  units  from 
the  origin.  As  expected,  most  of  the  error  is  localized  right  around  the  singular  point. 


Figure  9:  Screen  problem:  real 
part  of  Ex  component 


Figure  10:  Imaginary  part  of  Ex 
component 
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Figure  11:  Screen  problem:  real 
part  of  Ey  component 


Figure  12:  Imaginary  part  of 
component 
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Abstract 

Multimedia  technology  provides  a  valuable  resource  to  boost  interest  in  engineering  education  and 
enhance  the  ability  to  teach  and  to  leam  from  engineering  courses.  The  ability  to  combine  visualization 
of  complex  mathematical  and  abstract  subjects,  gain  experience  with  virtual  laboratories,  develop  guided 
use  of  simulation  software,  and  even  experience  virtual  participation  in  practical  applications  are  among 
the  many  advantages  of  technology-based  education.  In  this  paper  we  describe  and  demonstrate  the 
main  features  of  four  multimedia  CD-ROMs  developed  by  the  CAEME  Center  for  Multimedia 
Education  and  Technology.  This  includes  the  Physics  Museum,  Calculus  Castle,  the  Genetics  CD- 
ROM,  and  the  most  recent  CD-ROM  on  Engineering  Electromagnetics.  In  addition  to  describing  and 
demonstrating  the  features  of  the  developed  multimedia  CD-ROMs,  results  from  statistical  studies 
showing  benefits  from  multimedia  and  technology-based  education  will  be  discussed.  Preliminary 
results  from  controlled  educational  experiments  show  that  students  leam  by  an  average  of  20%  more 
when  exposed  to  multimedia  components  in  their  learning  and  studying  of  their  technical  subject. 

Introduction 

The  use  of  simulation  software  as  part  of  engineering  education  is  a  common  practice,  and  its 
advantages  are  well  known.  Simulation  software  allows  students  to  experiment  with  phenomena  and 
practice  with  designs  which  are  too  complex  to  calculate  or  expensive  to  reproduce  in  a  laboratory. 
Furthermore,  many  simulation  software  packages  are  extensively  used  in  industry  and  it  is  important 
that  students  practice  and  be  familiarized  with  the  use  of  software  to  simulate  and  design  engineering 
problems. 

The  CAEME  Center  for  Multimedia  Education  and  Technology  at  the  University  of  Utah  has  been 
developing  software  packages  for  engineering  education  with  focus  on  "electromagnetics"  since  1990. 
The  first  product  from  the  Center  was  a  compilation  of  16  simulation  software  packages  developed  by 
engineering  schools  from  universities  across  the  USA  [1].  The  published  book  and  associated  software 
were  well  received,  but  many  instructors  reports  limited  use  because  of  the  extensive  effort  needed  to 
integrate  the  software  into  established  and  traditional  structures  of  academic  courses.  Instructors 
suggested  that  a  stand-alone  product  which  requires  minimum  intervention  by  teachers  would  be  more 
effective  in  helping  the  integration  of  software  in  regular  classroom  teaching  and  would  even  be  more 
useful  to  students  in  their  individual  studies. 

In  response  to  these  suggestions,  the  CAEME  Center  included  several  tutorials  in  its  second 
software  book  in  electromagnetics  [2,  3].  To  date,  the  Center  focuses  its  activities  on  the  development 
of  fully  interactive  multimedia  products  which  include  virtual  laboratories,  functional  virtual 
instruments,  animation  and  visualization  of  dynamic  phenomena,  and  virtual  experiences  with  practical 
applications  in  addition  to  the  guided  use  of  simulation  software.  It  is  important  to  note  that  the 
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development  of  multimedia  products  is  expensive  and  time  consuming.  Therefore,  it  is  necessary  that 
focus  be  placed  on  the  development  of  subject  and  concepts  that  can  benefit  most  from  the  multimedia 
technology.  Emphasis  should  be  placed  on  enhancing  interactivity  to  help  students  simulation, 
experiment,  manipulate  and  use  instruments,  and  even  experience  some  aspects  of  practical  applications. 
It  is  well  known  that  these  are  the  difficult-to-prepare  multimedia  components,  but  they  are  the  elements 
known  to  enhance  learning  and  increase  benefits  of  multimedia-  and  technology-based  education. 

In  the  following  sections,  the  procedure  for  implementing  the  interactive  multimedia  components 
in  the  CD-ROM  products  developed  by  the  CAEME  Center  will  be  described  in  more  detail. 

Brief  Description  of  the  Developed  Multimedia  CD-ROMs 

In  addition  to  the  two  software  books  on  electromagnetics,  CAEME  developed  four  interactive 
multimedia  CD-ROMs  in  calculus,  physics,  genetics,  and  engineering  electromagnetics.  The  following 
is  a  brief  description  of  each  of  these  CD-ROMs: 

a.  The  Calculus  Castle 

This  multimedia  CD-ROM  presents  users  with  an  animated  three-dimensional  castle  housing  eight 
complete  calculus  lessons.  By  selecting  a  topic  in  calculus  from  the  main  menu,  users  move  through  the 
castle  into  rooms  which  lead  to  lessons  on  the  selected  topics.  Examples  of  topics  include  polar 
functions,  limits,  area  under  curves,  rules  of  differentiation,  particle  motion,  and  definite  integrals.  In 
all,  over  eight  hours  of  discussion  are  included  with  emphases  placed  on  visualization,  interactive  Q&A 
sessions,  and  most  importantly,  experiences  with  applications  of  topics  in  calculus.  Once  inside  the 
lessons,  users  learn  by  watching  and  interacting  with  animated  graphics,  listening  to  narration,  running 
external  software,  and  taking  quizzes  to  evaluate  their  understanding. 

b.  The  Physics  Museum 

The  Physics  Museum  provides  an  environment  of  learning  basic  physics  concepts  by  conducting 
experiments  in  virtual  laboratories.  As  the  user  enters  the  museum,  he  faces  three  virtual  elevators 
which  lead  to  various  laboratories  in  physics.  The  first  elevator  leads  to  lessons  on  vectors  and 
coordinate  systems,  and  mechanics.  The  second  elevator  leads  to  lessons  on  circuits,  electrostatics,  and 
principles  of  wave  propagation;  while  the  third  elevator  has  topics  on  optics,  acoustics,  and  the  Frantic 
Physics  action  game.  All  lessons  integrate  two-  and  three-dimensional  graphics,  sound,  video,  and 
narration,  and  interactive  multimedia  virtual  laboratories.  Examples  of  the  developed  virtual  labs 
include  Coulomb's  experimental  apparatus,  experimental  verification  of  Snell's  law,  electric  circuits 
laboratory,  and  several  experiments  illustrating  acoustical  concepts  such  as  Doppler  shift  and 
interference  of  sound  waves. 

c.  The  Genetics  CD-ROM 

This  multimedia  CD-ROM  focuses  on  the  natural  history  of  human  genetics.  Topics  include  the 
cell  structure,  Mandellian  Genetics,  DNA  extraction  experiments,  Meioses  and  Mitoses,  and  an 
interactive  criminal  investigation  demonstrating  the  application  of  DNA  tests  in  criminal  investigations. 
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d.  Engineering  Electromagnetics 

The  developed  CD-ROM  contains  lessons  that  constitute  a  complete  introductory  course  in 
electromagnetics.  This  includes  electrostatics,  magnetostatics,  Maxwell’s  equations,  electromagnetic 
waves  dielectric  materials  and  their  applications,  transmission  lines,  antennas,  and  an  extensive  package 
of  simulation  software  including  2D  FDTD,  MoM,  transmission  line  solution  on  the  Smith  Chart,  and  a 
2D  Finite  Element  code  written  in  LabVIEW.  It  is  interesting  to  note  that  even  in  introducing  a  highly 
mathematical  topic  like  Maxwell’s  equations,  dependence  on  equations  and  text  was  minimized  and 
instead  users  were  taken  back  to  the  laboratory  of  the  pioneers  who  experimentally  developed  these 
equations  and  were  presented  with  virtual  apparatus  and  allowed  to  conduct  the  same  experiments  which 
lead  to  the  development  of  these  equations.  Every  effort  was  made  to  describe  and  emphasize  practical 
applications  of  electromagnetic  fields  not  by  listing  and/or  describing  these  applications  but  by  allowing 
the  user  to  experience  these  applications  in  virtual  environments. 

Enhanced  Interactivity 

As  mentioned  in  earlier  discussion,  it  is  important  to  include  as  much  student  interaction  with  the 
multimedia  application  as  possible  to  help  maintain  interest,  increase  effectiveness  of  learning,  improve 
retention  and  enhance  exploration.  It  is  believed  that  interactivity  is  the  cornerstone  for  a  good 
multimedia  educational  product  and  effective  multimedia  tutoring.  In  a  multimedia  product, 
interactivity  may  be  enhanced  by  including  the  following: 


•  simulation  software 

•  virtual  laboratories  ,  .  »  uimiw 

•  use  of  intelligent  virtual  instruments  such  as  those  that  may  be  developed  using  LabVIEW 

•  using  gaming  strategy  ,  r  , 

•  by  providing  the  student  with  the  ability  to  navigate  through  the  lesson  and  the  freedom  to 

tailor  it  to  his  needs. 

It  should  be  noted  that  these  benefits  do  not  come  without  cost.  The  more  complex  and  detailed 
multimedia  environments  are,  the  more  taxing  on  the  underlying  system.  Applications  that  contain 
extensive  video,  narration,  sound  effects,  graphics,  and  extensive  navigation  capabilities  can  become 
slow,  choppy  on  less-capable  computers,  and  present  excessive  demands  on  memory  size  and  hardware 
capabilities.  This  does  not  mean  that  these  elements  should  be  avoided,  but  with  proper  planning  and 
preparation,  these  concerns  can  be  greatly  alleviated. 

Figures  1  and  2  show  some  results  from  simulation  software  packages  included  in  the 
electromagnetics  CD-ROM.  The  first  demonstrates  real-time  results  from  the  2D  FDTD  code,  while  the 
other  shows  results  from  the  Smith  chart  transmission  line  simulation  software  package. 

Figures  3  and  4  show  screen  captures  illustrating  some  of  the  developed  virtual  laboratories. 
Figure  3  shows  the  Biot-Savart  experiment  virtual  laboratory,  while  Fig.  4  shows  the  transmission  line 
impedance  measurements  virtual  lab. 

Figure  5  shows  the  Time  Domain  Reflectometer  virtual  instrument  developed  using  LabVIEW, 
while  Fig.  6  shows  the  navigation  toolbar  used  in  all  the  CAEME  CD-ROMs.  Figure  7  shows  an 


214 


example  of  a  virtual  experience  with  a  practical  application,  the  microwave  method  for  lung  water  meas¬ 
urements. 

These,  as  well  as  other  features  of  the  interactive  multimedia  CD-ROM  products  developed  by  the 
CAEME  Center,  will  be  described  and  demonstrated  in  the  presentation.  In  addition,  comments  will  be 
made  and  statistical  data  will  be  presented  regarding  the  effectiveness  and  the  educational  value  of  com¬ 
puter-  and  multimedia-based  instruction. 
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Figure  1.  A  screen  caputured  during  a  simulation  performed 
with  the  2D  FDTD  software  package. 


Figure  2.  A  screen  from  the  Electronmagnetics  CD- 
ROM  illustrating  the  use  of  the  Smith  Chart. 


Figure  3.  Biot-Savart  experiment  virtual  laboratory.  Figure  4.  Transmission  line  impedance  measurement  vir¬ 
tual  laboratory. 
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Figure  6  .  The  toolbar  used  for  navigation  in  all  the  CAEME  CD-ROMS. 


lung  water  content. 
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1  Introduction 

Electromagnetic  modeling  of  microstrip  structures,  such  as  antennas,  circuits  and  interconnects, 
becomes  more  and  more  essential  comparing  with  the  traditional  quasi-static  methods.  For 
this  purpose,  a  variety  of  methods  have  been  developed.  A  review  of  them  reveals  that  the 
method  of  moments  (MoM)  solution  of  the  integral  equation  has  received  intense  attention.  In 
this  method,  the  evaluation  of  Green’s  functions  and  the  choice  of  basis  functions  are  crucial  to 
obtain  an  accurate  and  efficent  solution.  Therefore,  many  variations  of  this  method  have  been 
implemented.  Among  them,  the  solution  based  on  the  mixed  potential  integral  equation  (MPIE) 
using  the  Rao-Wilton-Glisson  (RWG)  basis  function  [1]  is  an  attractive  approach  [2]— [4],  because 
the  RWG  basis  function  offers  more  flexibility  to  model  arbitrarily  shaped  structures,  and  MPIE 
provides  less  singular  kernel  comparing  with  the  electrical  field  integral  equation  (EFIE). 

On  the  other  hand,  when  microstrip  structures  become  more  and  more  complicated,  multi¬ 
layer  media  are  employed  to  allow  for  more  versatile  designs  [4],  [5].  This  fact  necessitates  the 
modeling  to  be  capable  of  dealing  with  microstrip  structures  in  multilayer  media.  To  achieve  this 
goal,  the  derivation  of  Green’s  functions  for  multilayer  media  is  crucial.  As  we  know,  the  spatial 
domain  Green’s  function  is  the  inverse  Hankel  transform  of  the  spectral  domain  counterpart, 
which  is  commonly  known  as  the  Sommerfeld  integral  (SI).  Generally  speaking,  the  analytical 
solution  of  the  SI  is  not  available,  and  the  numerical  integration  is  time-consuming  since  the 
integrand  is  both  highly  oscillating  and  slowly  decaying.  Several  efficient  techniques  have  been 
developed  to  speed  up  this  numerical  integration,  such  as  the  method  of  averages  [6]  and  the 
asymptotic  extraction  [7].  To  further  accelarate  the  computation,  the  scheme  that  precomputes 
the  integrals  and  uses  table  lookup  and  interpolation  techniques  is  popularly  used  [4].  Another 
approach  to  rapidly  evaluate  the  SI  is  the  discrete  complex  image  method  (DCIM)  [8]— [IX].  The 
basic  idea  of  the  DCIM  is  to  approximate  the  spectral  kernal  by  a  sum  of  complex  exponen¬ 
tials  using  the  Prony  method  or  the  generalized  pencil-of-function  (GPOF)  method  [12].  Then, 
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the  Si’s  can  be  evaluated  in  closed  forms  via  the  Sommerfeld  identity.  This  method  has  been 
extensively  employed  to  analyze  microstrip  structures.  However,  most  of  the  work  is  confined 
to  single-layered  or  double-layered  structures  because  of  the  difficult  treatment  of  surface-wave 
contribution  for  multilayer  media. 

This  paper  first  presents  the  MPIE  formulation  for  a  general  microstrip  structure  in  multilayer 
media  and  the  MoM  solution  to  this  equation.  Then,  we  derive  the  spectral  domain  Green’s 
functions  for  multilayer  media.  The  DCIM  is  employed  to  efficiently  evaluate  the  SI.  It  shows 
that  the  extraction  of  surface-wave  contribution  is  not  necessary,  which  makes  the  DCIM  very 
suitable  for  multilayer  media.  Therefore,  the  spatial  domain  Green’s  functions  can  be  represented 
in  closed  forms.  Combination  of  the  Green’s  functions  and  the  RWG  basis  functions  offers  an 
accurate  and  efficient  modeling  of  microstrip  structures  in  multilayer  media.  Numerical  results 
are  presented  and  a  good  agreement  with  measured  or  published  data  is  observed. 

2  Formulations 

Consider  a  general  multilayer  medium,  as  shown  in  Fig.  1.  For  layer  i ,  the  permittivity  and 
permeability  are  denoted  as  ez  and  fa,  respectively.  In  most  cases,  the  bottom  is  backed  by  a 
ground  plane  and  the  top  is  open.  The  medium  is  assumed  to  be  laterally  infinite.  In  this  section, 
the  MPIE  formulation  for  general  microstrip  structures  is  derived.  To  obtain  the  spatial  domain 
Green’s  functions,  we  first  derive  analytically  the  Green’s  functions  in  the  spectral  domain.  Then, 
the  DCIM  is  employed  to  convert  the  spectral  domain  Green’s  functions  into  the  spatial  domain, 
which  can  be  written  as  closed  forms. 

2.1  Mixed  potential  integral  equation 

The  microstrip  structures  are  assumed  to  be  of  infinitesimal  conductor  thickness  and  located  in 
the  p-plane.  With  excitation  by  an  applied  field  E°,  the  induced  current  on  the  microstrips  can 
be  found  by  solving  the  following  MPIE 

j(jz  x  [A(r)  +  ^V$(r)]  =h  Ea(r)  (1) 

where  the  vector  and  scalar  potentials  can  be  expressed  as 

A(r)  =  £Gx(r,r')-J(r  ')*'  (2) 

<H*)  =  ffsG,(r,r')VJ(t')ds'  (3) 

in  which  GA  and  Gq  denote  the  Green’s  functions  for  the  vector  and  scalar  potentials,  respectively. 

To  solve  the  integral  equation  (1)  by  the  MoM,  we  first  divide  the  microstrips  into  triangular 
elements  and  then  expand  the  current  on  the  microstrips  using  the  RWG  basis  function  [1], 
Applying  Galerkin’s  method  to  (1)  results  in  a  matrix  equation.  The  current  on  the  microstrips 
can  be  obtained  by  solving  the  matrix  equation. 

The  fast  evaluation  of  the  Green’s  functions  is  critical  for  an  efficient  analysis.  However,  the 
spatial  domain  Green’s  functions  is  represented  by  the  SI 

Gafay)  =  ^J^Gaykl,,zy)i^\kp\p-pi\)kedkf  (4) 
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where  Ga  is  the  xx-component  of  and  Ga>q{kp,z,z')  is  the  spectral  domain  counterparts  of 
Gai9(r,r'),  which  is  time-consuming  to  evaluate.  In  the  next  two  subsections,  we  will  give  the 
general  form  of  spectral  domain  Green’s  functions  and  the  efficient  evaluation  of  Si’s  by  the 
DCIM. 

2.2  Spectral  domain  Green’s  functions 

The  basic  principle  to  obtain  the  spectral  domain  Green’s  functions  due  to  an  arbitrary  source 
embedded  in  a  multilayer  medium  is  to  first  decompose  the  fields  into  TE  and  TM  waves.  These 
TE  and  TM  components  are  then  expanded  in  terms  of  the  plane  waves  and  their  propagations  in 
multilayer  media  can  be  characterized  by  the  generalized  reflection  and  transmittion  coefficient. 
For  planar  microstrip  configurations,  we  consider  a  horizontal  electric  dipole  (HED)  embedded 
in  layer  m.  The  field  point  is  located  in  layer  n.  The  spectral  domain  Green’s  functions  for  scalar 
and  vector  potentials  can  be  written  as 

(5> 


where  F£E  and  F™  are  the  TE  and  TM  components  in  layer  n,  respectively,  which  can  be 
derived  as  in  [13]. 


2.3  Discrete  complex  image  method 

Once  the  spectral  domain  Green’s  functions  are  obtained,  the  next  step  is  to  evaluate  the  SI, 
which  is  very  time-consuming  because  of  the  highly  oscillating  and  slowly  decaying  behavior  of  the 
integrand.  Although  some  techniques  have  been  proposed  to  fast  evaluate  the  SI,  the  numerical 
integration  is  still  the  bottleneck  to  design  a  fast  algorithm.  The  DCIM  obviates  numerical 
integration  and  represents  the  SI  in  a  closed  form.  Initially,  the  DCIM  first  extracts  the  quasi¬ 
static  and  surface-wave  contributions  from  the  spectral  domain  kernel  and  then  approximates 
the  remainding  kernel  by  a  sum  of  complex  exponentials  and  finally  evaluates  the  integral  via 
the  Sommerfeld  identity.  Because  the  extraction  of  surface-wave  contributions  for  multilayer 
media  is  quite  cumbersome,  it  is  difficult  to  extend  the  DCIM  to  multilayer  media.  The  GPOF 
method  has  been  applied  to  cast  the  Green’s  functions  into  closed  forms,  which  is  more  robust 
and  less  noise  sensitive  compared  to  the  original  Prony  method.  The  use  of  GPOF  also  shows 
that  the  extraction  of  surface- wave  contributions  is  not  necessary,  which  makes  it  easy  to  employ 
the  DCIM  for  the  multilayer  media. 

Rewrite  the  spectral  domain  Green’s  function  in  a  simple  form  as 

(7) 

As  the  first  step  of  the  DCIM,  the  quasi-static  contributions,  which  dominate  as  kp  oo,  are 
extracted  first,  which  makes  the  remaining  kernel  decay  to  zero  for  sufficently  large  kp.  Therefore, 
we  can  rewrite  (7)  as 

G  =  lzlk  +  -^.  (8) 
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where  F0  denotes  the  quasi-static  term.  With  the  aid  of  the  Sommerfeld  identity,  the  quasi-static 
contribution  from  the  second  term  in  (8)  can  be  evaluated  as 


G0  =  F0 


e~jkmP 

Anp 


The  first  term  in  (8)  can  be  approximated  as  a  sum  of  complex  exponentials  using  the  GPOF 
method.  The  deformed  integration  path  in  the  complex  kmz  is  chosen  as  kmz  =  km  [(1  -  t/T0)  -  jt ], 
where  To  is  determined  such  that  F— To  becomes  negligible  beyond  Tq.  Uniformly  sampling  F— Fo 
on  t  G  [0,  T0]  and  applying  the  GPOF  method,  we  can  approximate  F  -  F0  as 


F-FQ  =  '£aie-k^ 

»=i 

Again  using  the  Sommerfeld  identity,  we  obtain 

g -jkmp  Nc  e-jkmTi 


G  —  F0- 
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(10) 


(11) 


where  r*  —  \j p2  -bf  is  complex,  which  is  the  location  of  the  complex  image,  and  Nc  is  the 
number  of  complex  images. 

As  mentioned  in  [10],  when  the  source  is  in  the  bounded  region  of  multilayer  media,  a 
modification  of  the  DCIM  is  necessary.  When  the  source  is  in  the  bounded  layer,  the  exponentials 
are  written  in  terms  of  the  propagation  coefficient  of  the  unbounded  layer  instead  of  that  of  the 
source  layer. 


3  Numerical  Results 

First,  we  verify  the  spatial  domain  Green’s  functions  obtained  by  the  DCIM.  Consider  the  three 
layer  medium  [11],  The  first  layer  is  the  free  space.  The  second  layer  is  Teflon  with  relative 
permittivity  er  =  2.1  and  thickness  0.7  mm.  The  third  layer  is  GaAs  with  relative  permittivity 
er  =  12.5  and  thickness  0.3  mm,  which  is  backed  by  a  ground  plane.  The  frequency  is  30  GHz. 
There  are  four  types  of  Green’s  functions  we  seek.  The  HED  can  be  at  the  interface  of  the  first 
and  the  second  layer  m  =  1  or  the  second  and  the  third  layer  m  =  2.  The  field  point  also  has 
two  choices,  n  =  1  or  n  =  2.  The  Green’s  functions  of  all  these  four  types  are  given  in  Fig.  2 
compared  to  those  obtained  by  the  numerical  integration  along  the  Sommerfeld  integration  path 
(SIP).  A  good  agreement  can  be  observed.  The  computation  time  for  the  DCIM  is  negligible 
compared  to  the  numerical  integration.  In  this  calculation,  we  choose  T0  =  20  and  Nc  —  5. 

After  the  validation  of  this  technique,  we  incorporate  the  DCIM  into  the  MoM  solution 
of  MPIE  for  microstrip  structures  in  multilayer  media.  We  first  consider  a  proximity-coupled 
circular  patch  antenna.  The  two  substrate  layers  have  the  same  relative  permittivity  er  =  2.62 
and  thickness  1.59  mm.  The  microstrip  line  is  on  the  bottom  layer  and  the  patch  is  on  the  top 
layer.  The  radius  of  the  patch  is  17.5  mm  and  the  width  of  the  microstrip  line  is  4.372  mm. 
The  overlapping  length  may  be  used  to  control  the  coupling.  Here  it  is  17.5  mm.  The  input 
impedance  with  the  reference  plane  79.0  mm  away  from  the  patch  center  is  given  in  Fig.  3,  which 
agrees  well  with  the  measured  data  from  [16]. 
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We  take  a  two-port  asymmetric  antenna  as  another  example  [17].  The  two  dipoles  on  the 
top  layer  have  the  different  lengths  so  that  the  dual  frequency  operation  is  achieved.  The  two 
substrate  layers  have  the  same  relative  permittivity  er  =  2.17.  The  top  layer  has  thickness  1.6 
mm  and  the  bottom  layer  has  thickness  0.8  mm.  On  the  top  layer  there  are  two  orthogonally 
crossed  dipoles.  The  transversal  one  has  length  11.9  mm  and  the  longitudinal  one  has  length 
10.2  mm.  The  width  of  them  is  1.7  mm.  On  the  bottom  layer,  the  feeding  line  is  2.2  mm  wide. 
The  transversal  one  is  located  at  a  distance  of  2.97  mm  from  the  center  of  crossed  dipoles  and 
the  longitudinal  one  is  3.4  mm  away.  Fed  from  the  port  on  the  left,  the  transversal  dipole  is 
excited  at  the  resonant  frequency  8.4  GHz.  The  current  distribution  can  be  seen  from  Fig.  4(a). 
Fed  from  the  port  on  the  bottom,  the  longitudinal  dipole  is  excited  at  its  resonant  frequency  9.6 
GHz.  The  current  is  shown  in  Fig.  4(b).  At  11.2  GHz,  both  dipoles  are  resonant.  The  incident 
power  is  essentially  transmitted  from  one  port  to  the  other  port.  The  magnitude  of  S-parameters 
is  given  in  Fig.  4(c).  The  measured  data  are  available  from  [17].  The  good  agreement  can  be 
observed. 

4  Conclusion 

This  paper  gives  an  efficient  MoM  solution  of  the  MPIE  formulation  for  a  general  microstrip 
structure  in  multilayer  media.  In  this  method,  the  spectral  domain  Green’s  functions  for  multi¬ 
layer  media  are  derived  and  then  converted  to  the  spatial  domain  by  the  DCIM,  which  obviates 
the  time-consuming  numerical  integration  of  the  SI.  The  RWG  basis  functions  are  employed 
to  provide  more  flexibility  to  model  the  arbitrary  shapes.  At  last,  some  multilayer  microstrip 
structures  are  analyzed.  The  agreements  with  measured  or  pulished  data  are  observed. 
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Figure  1:  A  multilayer  medium  with  source  and  field  points  in  layer  m  and  layer  n,  respectively. 


log{ko*rho)  log(ko’rho) 

Figure  2:  Green’s  functions  of  the  scalar  and  vector  potentials  in  three  layer  medium.  The  first 
layer  is  the  free  space.  The  second  layer  is  Teflon  with  relative  permittivity  er  =  2.1  and  thickness 
0.7  mm.  The  third  layer  is  GaAs  with  relative  permittivity  er  =  12.5  and  thickness  0.3  mm, 
which  is  backed  by  a  ground  plane.  The  frequency  is  30  GHz. 
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Figure  3:  Input  impedance  for  a  proximity-coupled  microstrip  antenna.  The  frequency  is  from 
2.80  GHz  to  3.15  GHz  with  increment  0.05  GHz. 


(a)  f=8.4  GHz 


(c) 


(b)  f=9.6  GHz 


Figure  4:  Current  distributions  and  S-parameters  for  a  two-port  asymmetric  antenna. 


225 


Modeling  Wideband  Conformal  Antennas  using  Finite  Element-Boundary  Integral 

Techniques 

Keith  D.  Trott  and  Rene  D.  Guidry 
Mission  Research  Corp 
147  John  Sims  Parkway 
(850)729-0855  (voice) 
trott@fwb.ouif.net 


Abstract 

Mission  Research  Corporation,  in  collaboration  with  Michigan  State  University  (MSU),  is 
applying  unique,  state-of-the-art  computational  electromagnetics  (CEM)  tools  to  investigate 
the  performance  parameters  of  wide  bandwidth,  multi-modal,  conformal  antennas.  These 
antennas  are  used  to  determine  angle-of-arrival  (AoA)  by  means  of  multi-modal  phase 
comparison.  That  is,  the  received  phase  difference  between  two  simultaneous  modes  of  a 
signal  from  a  single  aperture  yields  the  AoA.  We  will  discuss  our  efforts  to  develop  finite 
element  mesh  generators  for  these  types  of  antennas  and  the  use  of  these  special  meshes 
with  our  FE-BI  CEM  tools  to  predict  antenna  performance  parameters  such  as  radiation 
pattern  and  impedance.  Representative  data  will  be  discussed. 

Introduction 

The  MRC  Team’s  FE-BI  CEM  tools  permit  rapid  analysis  and  design  of  conformal  antennas, 
with  particular  emphasis  on  wide  bandwidth,  multi-modal  antennas  such  as  the  ones  used  in 
angle  of  arrival  (AoA)  or  direction  finding  (DF)  applications.  These  antennas  are  cavity- 
backed  and  flush-mounted  on  planar,  cylindrical,  and  even  doubly-curved  surfaces.  Our 
CEM  tools  are  based  on  the  finite  element-boundary  integral  (FE-BI)  method.  This  CEM 
technology  is  uniquely  capable  of  modeling  complex  geometrical  and  material  properties. 
Below  we  present  a  brief  summary  of  the  FE-BI  method. 

Finite  Element  Boundary  Integral  (FE-BI)  Method 

Hybrid  finite  element  methods  (HFEM)  have  been  explored  for  several  years,  most  often 
being  called  either  the  finite  element-boundary  integral  (FE-BI)  or  the  finite  element-integral 
equation  (FE-1E)  method.  Both  combine  finite  element  and  integral  equation  methods. 
Unfortunately,  a  general  FE-IE  method  is  not  currently  feasible  due  to  heavy  computational 
resource  demand.  In  such  a  case,  electric  and  magnetic  currents  are  needed  on  the  mesh 
boundary  as  well  as  the  three  components  of  the  interior  field.  All  IE  formulations  result  in  full 
matrices  due  to  global  coupling;  hence,  the  two  currents  on  the  mesh  boundary  impose  an 
excessive  computational  burden  in  terms  of  memory  and  compute  cycles.  Therefore,  FE-IE 
formulations  typically  are  restricted  to  specialized  situations  such  as  cavity-backed  patches 
or  slots  recessed  in  an  infinite  metallic  surface.  In  this  case,  the  FE-IE  method  can  be 
formulated  such  that  it  requires  only  one  current  (magnetic)  over  the  apertures  located  within 
the  ground  plane  or  circular  cylinder  [1].  Accordingly,  the  memory  and  computational 
requirements  associated  with  the  mesh  boundary  is  minimized.  In  addition,  very  large 
apertures  can  be  efficiently  modeled  by  exploiting  Toeplitz  symmetry,  present  in  the  IE  sub- 
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matrix,  provided  these  apertures  lie  in  a  metallic  ground  plane  or  circular  cylinder  and  uniform 
sampling  is  used  on  the  mesh  boundary.  Figure  1  illustrates  the  application  of  the  FE-IE  (or 
FE-BI)  method. 


Figure  1 :  Application  of  FE-BI  to  a  conformal  antenna.  Note  computational  volume. 
Anisotropic  Planar  FE-BI  Computer  Code 

MRC  developed  and  delivered  for  GUI  integration,  a  Fortran  computer  code,  called  PRISM, 
capable  of  modeling  anisotropic  antenna  loading  materials.  This  code  also  permits  modeling 
R-card  loads  at  the  top  of  any  layer  of  the  mesh.  PRISM  uses  right  prism  elements  to 
discretize  the  volume  as  well  as  to  expand  the  unknown  electric  field.  The  computer  program 
has  the  following  capabilities: 

1 .  Plane  wave  incidence. 

2.  Filamentary  feed  models. 

3.  Radar  cross  section  calculation. 

4.  Power  gain  calculation  for  radiation  analysis. 

5.  Voltage  at  feed  pins  due  to  plane  wave  reception. 

6.  Input  impedance  calculation. 

7.  Anisotropic  materials. 

8.  Resistive  (R-card)  treatments. 

Anisotropic  materials  can  be  specified  by  fully  populated  symmetric  or  asymmetric 
permittivity  or  permeability  tensors.  Validation  has  been  performed  by  comparison  with 
XBRICK  results  for  a  patch  antenna  for  both  scattering  and  radiation. 

The  MRC  Team  has  been  a  leading  innovator  in  custom  FE-BI  solutions  for  antenna 
analysis.  FE-BI  is  the  only  leading  CEM  technology  capable  of  providing  the  geometric 
fidelity,  efficiency,  and  accuracy  required  for  demanding  conformal  antenna  design  and 
characterization. 

Automatic  Mesh  Generators 

We  have  developed  a  stand-alone  C  language  code  for  each  type  of  antenna  we  are 
modeling.  These  codes  will  be  integrated  with  a  graphical  user  interface  (GUI)  currently 
under  design.  The  basis  of  the  mesh  generators  is  Camegie-Mellon's  (CMU's)  TRIANGLE 
Planar  Surface  Mesher.  It  is  written  in  C  (source  code  is  freely  available),  uses  X-Windows 
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interface  for  mesh  viewing,  and  we  currently  have  it  running  on  our  SGI  and  other  machines 
using  Linux  We  have  created  mesh  generators  for  the  log-periodic,  the  equiangular  spiral, 
the  Vivaldi  notch,  and  the  sinuous  spiral.  The  associated  polarization  of  the  circular  antennas 
is  given  below: 

1 .  Circular  log-periodic  (linear  polarization). 

2.  Spiral  (circular  polarization). 

3.  Sinuous  Spiral  (dual  circular  or  linear  polarization). 

Log-Periodic  Antenna  Comparison: 

AFRL/SNRP  built  a  log-periodic  antenna  and  measured  it  using  their  RASCAL  facility.  The 
antenna  was  then  modeled  using  our  FE-BI  code  and  compared  to  the  measurements  The 
results  were  very  good  and  were  presented  at  the  1998  URSI  Radio  Science  Meeting  [2]  and 
are  shown  below  in  Figure  2. 


Figure  2:  Comparison  of  measured  and  calculated  data  for  6  and  8  GHz 


Spiral  Mesh  Generator 

Mesh  generators  have  been  developed  for  both  two-arm  and  four-arm  equiangular  spiral 
antennas  (SPIRAL2  and  SPIRAL4  respectively).  The  user  can  specify  typical  spiral  antenna 
attributes  such  as  the  antenna  flare  rate,  maximum  amount  of  rotation  of  the  spiral  arms^  and 
initial  distance  between  the  antenna  arms.  The  codes,  SPIRAL2  and  SPIRAL4,  are  actually 
mesh  generator  interfaces  used  to  create  the  input  files  used  by  TRIANGLE  TRIANGLE 
creates  the  triangular  mesh,  based  on  the  input  parameters  specified  by  the  user  when 
SPIRAL2  or  SPIRAL4  is  run.  SHOWME,  also  developed  by  CMU,  is  used  to  view  the 
resulting  mesh.  Once  the  mesh  files  have  been  generated  by  TRIANGLE  an  MRC 
developed  code  (INTERFACE)  is  used  to  create  an  input  file  for  PRISM  (our  FE-BI  code) 
PRISM  is  then  used  to  calculate  various  electromagnetic  parameters  such  as  input 
impedance,  radiation  patterns,  and  far-field  patterns.  A  relatively  new  feature  of  PRISM  is 
the  ability  to  calculate  the  electric  surface  fields.  MRC  developed  codes,  such  as  EDGER, 
provide  an  ability  to  decompose  the  surface  field  into  components  (x,  y,  total,  rho,  or  phi). 
These  fields  may  then  be  viewed  in  the  MATLAB  environment  using  MRC  developed  codes 
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such  as  FIELDVIEW  or  PHASEVIEW.  Figure  3  provides  an  example  of  the  spiral’s  surface 
fields.  It  should  be  noted  that  these  plots  are  on  two  different  scales. 


Figure  3:  Surface  fields  for  a  2-arm  and  4-arm  Spiral 
Vivaldi  Notch  Antenna  Mesh  Generator: 

The  Vivaldi  Notch  program  creates  a  data  file  containing  the  xy-coordinates  of  the  antenna 
as  calculated  by  the  governing  Vivaldi  equation.  This  is  an  input  file  used  by  TRIANGLE  to 
form  the  mesh  and  set  attributes  of  the  geometry.  The  meshed  file  is  then  used  by  PRISM  to 
calculate  the  antenna  parameters.  As  mentioned  previously,  the  Vivaldi  Notch  antenna  is  a 
broadband  antenna.  Our  main  goal  is  to  investigate  the  behavior  of  the  surface  fields  as  a 
function  of  frequency.  We  expected  higher  frequencies  would  have  the  maximum  surface 
field  boundary  closer  to  the  feed  point,  and  the  boundary  would  move  outward  from  the  feed 
as  we  decreased  frequency.  This  is  demonstrated  in  our  plots.  The  geometry  was  modeled 
after  that  used  by  Eric  Thiele  and  Allen  Taflove  [3].  Figure  4  shows  examples  for  the  surface 
fields  in  a  Vivaldi  notch. 


Figure  4:  y-component  of  surface  fields  for  a  Vivaldi  notch 
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Sinuous  Spiral  Antenna 

The  sinuous  spiral  antenna  is  the  most  difficult  to  model.  This  antenna  is  defined  by  angles 
and  expansion  ratios.  Radii  are  related  by  the  expression 


Rp  -  tp-iRp-i,  0) 

where  tp  =  ratio  of  inner-to-outer  cell  radii  and  ap  =  angular  width  of  cells.  For  example,  R2  = 
ti  Ri.  This  ratio  can  be  constant;  hence,  the  inner-most  radius  would  be  equal  to  the  outer¬ 
most  radius  by  a  power  of  the  expansion  ratio.  A  sinuous  arm  is  formed  rotating  curve  ±  5 
about  the  origin  yielding  ap  ±  8.  The  aperture  design  is  self-complementary  when  a  =  45° 
and  5  =  22.5°.  The  polar  coordinates,  r  and  <j>,  of  the  sinuous  curve  are  given  by 


<t>  =  (-1)pa  sin 


180  ln(r/Rp) 


lnxn 


for  Rp+1  £  r  <  Rp . 


(2) 


The  details  are  shown  in  Figure  5.  The  angular  rotation  of  a  single  arm  through  360/N 
increments  forms  an  N-arm  structure.  Typical  four-arm  structure  develops  dual  linear  and 
dual  circular  polarization.  Orthogonal  linear  polarization  formed  by  feeding  the  opposite  arms 
180°  out  of  phase.  This  generates  two  rotationally  symmetric  beams  linearly  polarized  with 
respect  to  each  pair  of  arms.  Dual  circular  polarization  is  formed  by  combining  the  linear 
beams  with  a  ±  90°  hybrid. 


Figure  5:  Sinuous  spiral  geometry 

The  active  region  of  the  antenna  is  where  2(ap  +  8)r «  X/2  •  The  low  frequency  is  limited  by 
Ri;  hence,  the  low  frequency  limit  is  determined  by  Xt  =  4R1(ap+8).  The  high  frequency 
limit  is  bounded  by  feed  point  design.  The  smallest  segment  should  be  somewhat  smaller 
than  X/A ;  therefore,  the  upper  frequency  limit  is  defined  by  XH  =  8Rp(ap  +  8),  with  the  active 
region  moving  inward  with  increasing  frequency.  An  example  of  the  mesh  generated  by 
TRIANGLE  is  shown  in  Figure  6. 
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Figure  6:  Sinuous  mesh  generated  by  TRIANGLE 

The  output  using  this  mesh  in  our  PRISM  code  is  shown  in  Figure  7.  Note  the  fields  are 
somewhat  aligned  to  the  orthogonal  components. 


X  <<t  8  fOH? j  V  0>w3<xf  EUictnc  Surfeit  at  tfl  0  JOKs] 


Figure  7:  x  and  y-directed  surface  fields  of  sinuous  spiral  using  PRISM  code 


Antenna  Performance  Prediction 

We  have  discussed  our  efforts  to  develop  mesh  generators  for  wideband,  multi-modal 
antennas.  These  meshes  enable  accurate  and  efficient  modeling  of  the  performance  for 
these  antennas  using  the  FE-BI  method.  Research  continues  to  determine  design 
parameters  that  optimize  performance  of  these  antennas  to  solve  real-world  problems. 
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Abstract: 

-The  Northrop-Grumman  Duct  RCS  Prediction  code  is  based  on  the  curvilinear  hybrid  finite  element-surface  integral 
equation  technique  with  several  novel  enhancements  designed  to  cater  to  the  special  and  formidable  requirements  of 
duct  and  engine  inlet  analysis.  This  paper  focuses  on  the  unique  problems  encountered  while  applying  such  a  numerical 
technique  to  the  analysis  of  ducts  and  the  progress  made  in  circumventing  them.  First,  we  discuss  the  formulation  of  a 
Computed  Relationship  (CR)  which  enables  the  de-coupling  of  the  finite  element  and  surface  integral  equation  regions. 
Subsequently,  we  discuss  the  novel  features  of  a  banded  matrix  solver  developed  in-house,  which  yields  considerable 
savings  in  execution  time  and  storage  in  generating  the  CR  from  the  finite  element  system.  A  dense  linear  system  solver 
from  ScaLAPACK  is  used  to  solve  the  surface  integral  system  and  its  requirements  are  discussed  in  the  context  of  our 
problem.  Higher  order  basis  functions  to  improve  the  accuracy  of  RCS  predictions  or  equivalently  reduce  the  number  of 
discretization  unknowns  are  implemented.  Numerical  results  illustrating  the  code  performance  are  generated  on  multiple 
processors  of  a  high  performance  computing  architecture  -  the  SGI  Origin  2000. 


L  Introduction: 

The  basic  Northrop-Grumman  RCS  prediction  code  is  the  SWITCH  code  which  is  based  on  a 
hybrid  finite  element-surface  integral  equation  formulation  of  the  electromagnetic  scattering 
problem.  Recently  we  have  developed  a  special  RCS  code  to  predict  scattering  from  ducts  and 
engine  inlets.  This  duct  RCS  code  is  based  on  the  same  analysis  as  the  SWITCH  code,  however,  it 
separates  the  finite  element  equation  portion  of  SWITCH  from  its  surface  integral  equation  portion. 
The  Northrop-Grumman  duct  RCS  code  consists  of  two  codes  -  Sparse  and  Dense.  Sparse  is 
associated  with  the  finite  element  equation  and  Dense  is  related  to  the  surface  integral  equation  of 
the  discretized  Maxwell  equations. 

The  benefit  of  having  two  separate  codes  is  that  the  coefficient  matrix  associated  with  Sparse  is 
a  symmetric,  sparse  and  banded  matrix.  The  unknowns  in  Sparse  are  the  interior  (Nei)  and  the 
aperture  (Nes)  Electric  (E)  field  coefficients  and  the  aperture  magnetic  (H)  field  coefficients  (NhS). 
The  total  number  of  equations  in  sparse  is  Ne  (Ne=  Nei+Nes)  which  is  the  total  number  of  E-field 
unknown  coefficients.  Clearly  there  are  more  unknowns  than  equations  in  Sparse.  Sparse  is 
formulated  such  that  the  aperture  H  field  coefficients  are  treated  as  the  external  excitations  and  we 
solve  for  the  E  field  coefficients  in  terms  of  the  H  field  coefficients.  It  has  to  be  noted  that  for 
scattering  problems  we  do  not  need  the  interior  E  field  coefficients  for  far  field  computations.  The 
necessary  information  at  the  end  of  execution  of  Sparse  is  a  matrix  relationship  between  the 
aperture  E  field  and  aperture  H  field  unknown  coefficients.  We  name  this  matrix  the  CR  (for 
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Computed  Relationship)  matrix.  The  CR  matrix  with  Nes  rows  and  Nhs  columns  is  not  a  square 
matrix.  Also,  for  practical  problems  Nes *  Nh  and  Ne »  Nes. 

Section  II  describes  a  special  direct  solver  developed  at  Northrop-Grumman,  specifically  for  the 
banded  matrices  encountered  in  Sparse,  called  the  Northrop  Grumman  Direct  Sparse  Solver 
(NGDSS).  The  NGDSS  has  two  novel  features  -  fast  CR  matrix  generation  and  factored  matrix 
storage  reduction.  To  take  advantage  of  the  novel  features  of  NGDSS,  the  E  field  unknown 
coefficients  must  be  numbered  in  a  prescribed  manner  and  the  Reverse  Cuthill-McKee  (RCM) 
scheme  is  employed  for  this  numbering.  The  bandwidth  of  the  sparse  matrix  is  small  as  a  result  of 
using  the  RCM  scheme. 

The  walls  of  the  duct  of  interest  are  assumed  metallic,  hence  the  magnetic  current  vanishes  on 
the  walls.  The  surface  integral  equation  over  the  duct  exterior  is  comprised  of  the  duct  wall  and  the 
duct  aperture  opening  with  non-zero  magnetic  current.  The  aperture  magnetic  current  in  these 
regions  is  replaced,  via  CR  matrix  substitutions,  by  the  aperture  electric  current.  The  coefficient 
matrix  of  Dense  is  dense  and  square  of  order  Nh  where  Nh  (Nh  =  Nhw+  Nh)  is  the  total  number  of  H 
field  unknown  coefficients  which  includes  the  aperture  opening  and  the  duct  wall  unknown  J 
(electric  current)  coefficients. 

The  scattered  far  field  is  obtained  by  first  executing  Sparse  to  obtain  the  CR  matrix,  which  then 
serves  as  the  input  for  Dense.  RCS  of  the  duct  is  obtained  at  the  end  of  execution  of  dense. 

Section  III  describes  the  procedure  for  connection  of  the  coefficient  and  right-hand  side  matrices 
of  Dense  to  the  ScaLAPACK  [1]  dense  equation  solver.  ScaLAPACK  has  excellent  scaling  and 
portability  characteristics  on  a  wide  variety  of  distributed  memory  and  shared  memory  parallel 
computing  platforms. 

The  basis  and  testing  functions  in  Sparse  and  Dense  are  the  linear  roof  top  functions.  A  different 
duct  code  based  on  the  quadratic,  second  order  Lagrange  interpolation  polynomial  basis  functions 
has  also  been  developed.  For  a  given  number  of  unknowns,  the  quadratic  basis  function  code  is 
more  accurate  than  the  linear  roof  top  basis  function  code  and  will  be  discussed  in  Section  IV. 

Numerical  results  for  sample  duct  geometries,  obtained  from  computer  codes  utilizing  the 
proposed  analysis  schemes,  are  given  in  Section  V.  These  results  were  primarily  generated  on 
multiple  processors  of  the  SGI  Origin  2000  shared  memory  parallel  computer  located  at  the 
Aeronautical  Systems  Center  of  the  Major  Shared  Resources  Center  (ASC-MSRC),  Wright- 
Patterson  Air  Force  Base  (Ohio).  Sparse  was  parallelized  using  data-parallel  directives  and  hence 
was  based  on  a  shared  memory  programming  paradigm.  Dense  utilizes  MPI  (Message  Passing 
Interface)  thus  simulating  a  distributed  memory  paradigm  on  the  shared  memory  computer. 


D.  The  Northrop-Grumman  Direct  Sparse  Solver  (NGDSS) 

To  apply  NGDSS,  the  interior  E  field  unknown  coefficients  must  be  numbered  before  any  of 
the  aperture  E  field  unknown  coefficients  for  reasons  which  will  become  obvious.  The  RCM 
scheme  starts  with  the  aperture  E  field  unknown  coefficients  as  the  starting  unknowns.  Next,  the 
unknowns  that  are  the  immediate  (first  level)  neighbors  of  the  starting  unknowns  are  numbered. 
Second  level  neighbors,  immediate  neighbors  of  first  level  neighbors,  are  numbered  after  all  first 
level  neighbors.  This  process  is  continued  until  all  the  unknowns  are  numbered.  At  the  end,  the 
unknown  numbers  are  reversed  and  the  unknowns  associated  with  the  edges  of  the  aperture  opening 
have  the  highest  numbers. 

The  system  of  equations  (Ne)  associated  with  Sparse  can  be  expressed  as 
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In  equation  (1),  A  is  a  symmetric,  sparse,  square  matrix  of  order  Ne.  There  are 
Nhs  right-hand  sides  where  the  first  Net  rows  of  the  right-hand  side  matrix  are  zero  elements.  Since 
matrix  A  is  symmetric,  it  can  be  factored  using  the  LDLT  factorization  algorithm 
A  =  LDLt  (2) 

In  equation  (2),  L  is  a  lower  triangular  matrix  with  unit  values  on  the  diagonal,  D  is  a  diagonal 
matrix  and  LT  is  the  transpose  of  L  and  is  hence  an  upper  triangular  matrix. 

After  matrix  A  is  factored,  the  solutions  of  (1)  can  be  obtained  as  follows.  Let 
LXk=  bk  k=  1,2 ...  Nhs  (3) 

where  Xk  is  some  intermediate  solution  vector,  bkis  the  k*  column  of  the  right-hand  side  matrix. 
The  intermediate  solution  vectors  in  (3)  can  be  obtained  by  forward  substitution.  Let 
DYk=Xk  k  =  1,2  . . .  Nhs  (4) 

The  new  intermediate  solution  vectors  Yk  can  be  found  easily  since  D  is  a  diagonal  matrix.  Let 
LTEk=Yk  k=  1,2  ...Nhs  (5) 

Ek  in  (5)  is  the  solution  vector  of  (1)  corresponding  to  the  k*  column  of  the  right-hand  side  matrix. 
Ek  can  be  obtained  by  backward  substitutions. 

The  CR  matrix  generation  can  be  greatly  simplified  since  the  first  Ne,  rows  of  the  right-hand  side 
matrix  are  zeros.  Let 


0 

1  Nd 

0 

X  Nej 

0 

Xk  = 

Xk. 

Yk  = 

X  Ne 

YkJ 

bk  = 

t  Nc  | 

H* 

The  solution  procedure  given  by  equations  (3)-(5)  becomes 


LXk  =  Hsk 


D  Yk  =  Xk  k=l,2...Nhs  (7) 


£T  Esk  =  Yk 

Matrix  D  is  a  diagonal  matrix  of  rank  Nes ■  The  diagonal  elements  of  D  are  the  last  Nes  diagonal 

elements  of  the  diagonal  matrix  D.  Matrix  L  is  a  lower  triangular  matrix  with  unit  values  on  the 
diagonals  and  of  rank  Nes.  It  is  comprised  of  the  last  Nes  rows  and  columns  of  matrix  L. 

The  procedure  given  by  (7)  explains  the  first  novel  feature  of  NGDSS,  which  is  its  speed  in 
generating  the  CR  matrix  once  A  is  factored.  In  essence  it  solves  a  greatly  reduced  system  of 
equations.  The  reduction  of  CPU  time  by  using  (7)  is  of  order  (Nj/Ne2)  which  is  large  for  practical 
problems  of  interest. 

The  second  novel  feature  of  NGDSS  can  now  be  explained.  It  is  clear  from  equation  (7)  that 
only  the  last  Nes  rows  and  columns  of  the  lower  triangular  matrix  L  (i.e.  L)  and  the  last  Nes  diagonal 

elements  of  the  diagonal  matrix  D  (i.e.  D  )  need  to  be  stored  for  obtaining  the  CR  matrix.  The 
matrix  A  is  factored  one  block  at  a  time.  In  Figure  1  let  nc  be  the  number  of  columns  per  block  and 
nbcmd  be  the  bandwidth  of  matrix  A.  The  memory  required  to  store  a  block  of  matrix  elements  is 


234 


nc*ribcmd  complex  numbers.  It  is  not  required  to  store  the  factored  matrix  elements  separately  from 
the  coefficient  matrix  A  since  the  coefficient  matrix  elements  are  overwritten  by  the  factored  matrix 
elements.  The  factored  matrix  elements  in  the  first  block  also  partially  factor  the  first  nbcmd 
columns  of  the  next  block.  The  total  storage  is  therefore  (nc+ nband)*  nband  complex  numbers  for 
the  lower  triangular  matrix  L.  In  addition  we  need  storage  of  nc+nband  complex  numbers  for  the 
diagonal  matrix  D. 

The  procedure  to  factor  matrix  A  with  storage  reduction  works  in  the  manner  described 
below.  First  the  matrix  elements  of  the  first  block  of  coefficient  matrix  A  are  computed. 

This  block  is  factored  column  by  column.  Once  a  column  is  factored,  the  partial  factorizations  of 
subsequent  columns  due  to  this  column  are  computed.  The  number  of  subsequent  columns  is  nbcmd. 
This  process  is  continued  until  the  last  column  (nc)  is  factored  and  the  partial  factorizations  due  to 
this  last  column  have  been  computed.  The  first  nbcmd  columns  of  the  second  block  have  already 
been  partially  factored  even  though  the  coefficient  matrix  elements  of  the  second  block  have  not 
been  calculated.  Next,  the  index  for  matrix  elements  are  shifted.  The  first  nc  columns  of  factored 
matrix  A  (the  first  block)  can  be  discarded.  The  last  nbcmd  columns  of  the  first  block  become  the 
first  nbcmd  columns  of  the  second  block.  The  original  matrix  elements  of  the  second  block  can  be 
.computed  and  superimposed  on  the  partially  factored  matrix  elements.  The  second  block  is  factored 
column  by  column  similar  to  the  procedure  used  for  the  first  block.  This  process  continues  until  the 
whole  matrix  is  factored. 

The  memory  requirement  for  the  above  procedure  is  ( nc+nbcmd)*(nbcmd+l )  complex 
numbers.  The  RCS  of  a  duct  of  any  size  can  be  analyzed  provided  that  nband  is  not  large.  Also,  no 
matter  how  small  nc  is,  for  very  large  nband  values,  (nc+ nband)* (nbcmd+1)  complex  numbers 
cannot  be  stored  in  memory.  For  these  cases  an  out-of-core  NGDSS  has  been  written  but  will  not  be 
discussed  here. 

HI.  Connection  of  Dense  to  ScaLAPACK  dense  solver: 

The  CR  matrix  from  Sparse  will  be  used  to  replace  the  aperture  E  field  unknown  coefficients 
by  the  aperture  H  field  unknown  coefficients.  The  problem  size  of  Dense  is  the  total  number  of 
exterior  and  aperture  H  field  unknown  coefficients. 

ScaLAPACK  is  a  library  of  computational  routines,  which  utilize  Message  Passing  Interface 
(MPI)  for  inter-processor  communication.  Dense  was  restructured  to  conform  to  MPI  in  order  to 
connect  it  to  ScaLAPACK.  A  two-dimensional  block  cyclic  data  distribution  is  employed  by 
ScaLAPACK  which  yields  an  excellent  compromise  between  load  balancing  and  the  ability  to  use 
level  BLAS3  kernels  for  efficient  computation  on  individual  processors.  Such  a  data  distribution 
requires  that  the  entries  of  the  matrix  be  allocated  to  abstract  processes  along  its  rows  and  columns 
such  that  the  product  of  these  processes  is  equal  to  the  total  number  of  available  physical 
processors.  If  np  is  the  total  number  of  available  processors,  it  can  be  expressed  as  a  product  of 
processes  along  the  matrix  row  ( nprow )  and  column  ( npcol )  as 

np  =  nprow*npcol  (8) 

Ideal  load  balancing  requires  that  nprow  and  npcol  should  be  equal  thereby  necessitating  that  np  be 
a  perfect  square.  However,  small  deviations  from  square  values  for  np  does  not  deteriorate  load 
balancing  appreciably  and,  in  general,  the  user  specifies  nprow  and  npcol  based  on  processor 
availability.  The  coefficient  and  right  hand  side  matrices  are  partitioned  with  two-dimensional 
blocks  such  that  each  processor  computes  and  stores  only  a  fraction  of  the  total  matrix.  The  two- 
dimensional  blocks  belonging  to  a  particular  processor  are  stored  contiguously  thus  leading  to  a 
rectangular  local  array  of  dimensions  mow  by  ncol.  Each  matrix  element  is  thus  addressed  by  a 
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global  index  corresponding  to  the  entire  matrix  and  a  local  index  corresponding  to  the  local  two- 
dimensional  array  of  each  processor.  Integer  arrays  are  used  to  connect  the  two  sets  of  indices.  It 
should  be  noted  that  the  values  of  nrow  and  ncol  might  vary  for  each  processor.  The  optimum  two- 
dimensional  block  is  square,  with  order  64,  on  most  high-performance  computing  architectures. 

The  memory  requirement  for  Dense  is  significant  since  each  processor,  requires,  in  addition 
to  the  coefficient  and  right  hand  side  matrices,  the  CR  matrix  and  the  matrix  coefficients  associated 
with  the  aperture  magnetic  current.  The  memory  requirement  is  alleviated,  by  partitioning  the  CR 
matrix,  such  that  each  processor  stores  only  the  elements  required  by  it  in  order  to  perform  CR 
substitutions.  Partitioning  yields  npcol  sub-matrices  thus  reducing  the  storage  requirement  of  each 
processor  by  1  /npcol.  Processors  with  the  same  row  number  in  the  two-dimensional  process  array 
store  the  same  elements  of  the  CR  matrix. 

The  memory  requirement  due  to  the  aperture  magnetic  current  is  alleviated,  by  defining  a 
one-dimensional  complex  array  of  size  Nes.  The  testing  function  and  associated  two  patches 
corresponding  to  a  local  or  global  row  number  are  obtained.  The  coefficients  associated  with  the 
aperture  magnetic  number  for  the  same  row  number  are  computed  and  stored  in  the  array.  The  CR 
matrix  substitution  for  any  global  column  number  corresponding  to  the  aperture  electric  current  is 
.  performed.  This  complex  array  can  be  re-used  for  different  row  numbers  thereby  greatly  reducing 
the  memory  requirement  for  the  aperture  magnetic  current.  This  reduction  is  at  the  cost  of  matrix 
build  time,  which  increases  by  approximately  a  factor  of  two. 

IV.  Higher  Order  Basis  Functions  for  Duct  Analysis 

RCS  prediction  accuracy  can  be  greatly  improved  by  using  higher  order  basis  functions  at 
the  cost  of  longer  matrix  build  times.  However,  the  total  build  time  continues  to  be  insignificant  due 
to  the  small  number  of  non-zero  elements  in  the  system  matrix  of  Sparse.  Higher  order  basis 
functions  increase  the  sparse  matrix  bandwidth  and  consequently  the  CPU  time  for  factorization. 
The  significant  matrix  build  time  of  Dense  is  further  increased  by  the  application  of  higher  order 
basis  functions.  Using  linear  roof  top  basis  functions  for  testing  alleviates  this  problem,  with  little 
deterioration  in  accuracy. 

Second  order  Lagrange  interpolation  polynomials  (quadratic)  were  the  adopted  higher  order 
basis  functions,  leading  to  a  sparse  matrix  with  twice  the  bandwidth  and  consequentially 
quadrupling  the  factorization  time  for  equal  number  of  unknowns  when  compared  with  rooftop 
basis.  Quadratic  basis  functions  can  be  explained  with  the  aid  of  Figure  2  where  a  curved  patch  has 
been  transformed  into  a  unit  square  in  the  u-v  plane.  Four  (2,5,8  and  1 1)  of  the  twelve  unknowns 
associated  with  the  patch  are  interior  unknowns  and  have  their  concomitant  basis  functions  defined 
over  the  patch  only.  The  rest  (1,3,4,6,7,9,10,12)  are  boundary  unknowns  with  their  basis  functions 
defined  both  over  the  patch  in  the  figure  and  its  neighbor  sharing  the  boundary  edge.  The  basis 
function  associated  with  edge  1  is  given  by 
fi  =  (1-u)  (l-2v)  (1-v)  (9) 

indicating  a  quadratic  and  linear  variation  in  the  transverse  and  longitudinal  directions  respectively. 
Higher  order  basis  function  definition  in  Sparse  is  similar  to  (9)  resulting  in  54  unknown 
coefficients  per  finite  element. 

V.  Results  and  Conclusions 

Figure  3  depicts  the  predicted  RCS  for  a  PEC  duct  with  a  square  cross-section  of  side 
3  wavelengths,  length  10  wavelengths  and  gridded  at  10  unknowns  per  wavelength.  This  geometry 
yields  a  problem  size  of 258100  for  Sparse  with  a  bandwidth  of 2640  and  a  CR  matrix  of  1740  ( Nes ) 
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by  1860  (Nhs).  The  system  matrix  was  built,  factored  and  the  CR  matrix  computed  in  50  minutes  on 
a  SGI  Origin  2000  with  32  processors.  The  duct  exterior  was  discretized  at  7  and  10  unknowns  per 
wavelength  in  the  longitudinal  and  transverse  direction  respectively  yielding  20400  unknowns  for 
the  electric  current.  Thus  a  completely  dense  system  of  order  20400  was  built,  factored  and  solved 
in  75  minutes  on  a  SGI  Origin  2000  with  32  processors.  On  64  processors,  the  same  system  was 
solved  in  40  minutes  resulting  in  an  excellent  parallel  efficiency  of  93.75%. 

Figure  4  depicts  the  predicted  RCS  for  a  similarly  shaped  duct  with  a  cross-section  of  side  4 
wavelengths  and  length  9  wavelengths.  Problem  size  for  Sparse  was  417690  with  a  bandwidth  of 
4720,  CR  matrix  was  of  size  3120  (Nes)  by  3280  (Ay,  yielding  a  run-time  of  7  hours.  The 
corresponding  fully  populated  dense  system  of  order  26560  was  solved  in  2.5  hours. 

The  improved  accuracy  of  higher  order  basis  functions  is  demonstrated  with  reference  to  the 
EMCC  duct  which  at  9.46  GHz,  has  a  square  cross-section  with  side  2  lambda  and  is  3  lambda  long. 
RCS  predictions  with  the  moment  method,  applied  by  discretizing  at  a  rate  of  16  unknowns  per 
wavelength  resulting  in  14272  unknowns,  agree  very  well  with  measured  data  and  will  serve  as  the 
benchmark  for  comparisons.  With  higher  order  basis  functions  the  discretization  was  5  unknowns 
per  wavelength  in  the  transverse  direction  and  7  unknowns  per  wavelength  in  the  longitudinal 
.direction  resulting  in  a  total  of 2000  unknowns  for  Dense  and  3915  for  Sparse.  Figure  5  shows  that 
RCS  predictions  with  higher  order  basis  functions  for  horizontal  polarization  agree  almost  exactly 
with  the  moment  method.  Figure  6  depicts  RCS  for  the  same  number  of  unknowns  while  using  sub- 
domain  basis  functions,  clearly  illustrating  the  higher  accuracy  of  higher  order  basis.  Figure  7 
compares  predictions  with  sub-domain  basis  employing  10  unknowns  per  wavelength  (totals  of 
6400  for  Dense  and  33630  for  Sparse)  and  the  moment  method  predictions  form  Figure  5.  Thus, 
Figures  5-7  illustrate  that  RCS  computed  with  higher  order  basis  functions  with  only  2000 
unknowns  is  more  accurate  than  that  computed  with  sub-domain  basis  functions  using  6400 
unknowns. 

A  procedure  to  efficiently  predict  the  RCS  of  ducts  of  arbitrary  shape  and  material 
composition  was  discussed.  The  formulation  of  a  computed  relationship  and  its  generation  from  the 
finite  element  system  was  given.  A  novel  banded  matrix  solver  was  developed  to  rapidly  generate 
the  CR.  The  surface  integral  equation  dense  system  was  solved  with  a  linear  equation  solver  from 
ScaLAPACK.  Higher  order  basis  functions  are  implemented,  which  reduce  the  required 
discretization  rate.  Numerical  results  to  validate  the  technique  were  presented.  Future  work  includes 
improving  the  efficiency  of  the  banded  matrix  solver  and  analysis  of  ducts  of  arbitrary  shape  and 
material  composition. 
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Figure  3:  RCS  of  a  square  c-s  duct  of  side  3a  and  length  10a 
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Figure  4:  RCS  of  a  square  c-s  duct  of  side  4X  and  length  9X 


PEC  Cavity  f=9.46Ghz  0.15*0.2*0.2 
Solid->MOM  Subdomain  basis  functions, 14272 
Dotted->Hybrid  Higher  order  basis  functions, 2000 
tha=90,  HH 


-20.0 

-20.0 

-30.0 

-31.0 


-40.04 


A 

/ 

\ 

^ _ _  s 

\ 

/ 

J 

N-'X/ 

/ 

UI - 

V\ , 

l\ 

I 

ft 

1 

\l  1 

il  1 

1 

u 

nt 

III" 

tf  1 

!  1  1 

d. 

1 

I  1 

I) 

L 

LL 

i 

1 

n 

Phi  in  deg. 


Figure  5:  Reference  RCS  predictions  compared  with  predictions  from  hybrid  higher  order  basis 
functions  using  2000  unknowns 
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Figure  6:  Reference  RCS  predictions  compared  with  predictions  trom  hybrid  subdomain  basis 
functions  using  2000  unknowns 
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Figure  7:  Reference  RCS  predictions  compared  with  predictions  from  hybrid  subdomain  basis 
functions  using  6400  unknowns 
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Abstract 

Although  the  methods  and  techniques  associated  with  hybrid  finite  element-boundary  integral  method 
have  been  available  in  the  literature  for  over  ten  years,  practical  advice  involving  the  implementation 
of  these  techniques  as  efficient  computer  code  is  not  generally  available.  The  author  has  recently 
developed  computer  programs  using  three  popular  finite  elements:  bricks,  right  prisms,  and  tetrahedra. 

In  this  paper,  specific  recommendations  are  made  to  potential  finite  element-boundary  integral 
computer  program  developers.  Topics  discussed  include:  specific  advantages  and  disadvantages  of 
each  element,  suggested  element  expansion  functions  for  use  with  anisotropic  materials,  issues 
involved  in  coupling  the  interior  finite  element  formulation  to  the  exterior  boundary  integral 
formulation,  and  comments  on  matrix  solution  techniques. 

Introduction  ... 

The  finite  element-boundary  integral  method  has  been  used  in  the  electromagnetics  analysis 
community  for  more  than  two  decades,  especially  for  electrostatics  and  magnetostatics  analysis. 
Solution  of  dynamic  problems,  such  as  antenna  characterization  or  radar  cross  section  calculation, 
required  the  introduction  of  edge-based  (also  known  as  vector)  finite  elements.  However,  even  with 
the  use  of  vector  finite  elements,  implementations  of  a  hybrid  finite  element  method  were  scarce  and 
limited  in  their  utility. 

Hybrid  finite  element  methods  utilize  a  boundary  integral  to  close  the  finite  element  mesh  by  providing 
the  relationship  between  the  tangential  electric  and  magnetic  fields  on  the  surface  of  the  mesh.  The 
boundary  integral  explicitly  couples  each  unknown  (or  edge)  on  the  surface  to  all  the  other  unknowns 
resulting  in  a  fully-populated  matrix.  Hence,  this  hybrid  finite  element-boundary  integral  (FE-BI) 
method  had  limited  utility  in  an  era  of  Intel  386-based  computers  with  upto  32  MB  of  RAM. 

However,  an  important  milestone  was  reached  in  the  development  of  useful  finite  element-boundary 
integral  computer  programs  when  BRICK  was  released  by  Jian-Ming  Jin  and  John  Volakis  [1].  These 
authors  realized  that  if  brick  elements  were  utilized  to  mesh  a  rectangular  volume  recessed  in  a 
metallic  groundplane,  the  resulting  uniformly  discretized  surface  aperture  results  in  a  boundary  integral 
matrix  that  is  block  Toeplitz.  Accordingly,  there  is  tremendous  redundancy  in  the  boundary  integral 
matrix  so  that  only  a  small  fraction  of  the  fully  populated  matrix  need  be  stored.  In  addition,  if  the 
matrix  is  stored  in  a  convenient  manner,  iterative  solutions  of  the  matrix  can  be  accelerated  through  the 
use  of  Fast  Fourier  Transforms.  Such  a  solution  made  the  hybrid  FE-BI  method  attractive  for  use  on 
commonly  available  computers.  Indeed,  even  with  contemporary  computer  resources,  use  of  the 
AFRL’s  XBRICK  computer  program  (see  http://www.adinc.com)  is  as  strong  as  ever. 
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As  computer  resources  have  improved  (today,  one  can  purchase  a  dual  processor  computer  with  1  GB 
of  RAM  for  less  than  $10,000),  the  limitations  of  BRICK  have  been  alleviated  by  eschewing  the 
memory  and  computational  advantages  of  a  Block  Toeplitz  boundary  integral  matrix.  Specifically, 
computer  program  designers  are  utilizing  triangular  surface  meshes  and  either  right  prism  or  tetrahedra 
meshes  to  solve  rather  complex  antenna  design  problems.  In  doing  so,  these  methods  involve  a  fully- 
populated  boundary  integral  sub-matrix  and  hence  it  is  important  to  carefully  design  the  computer 
program  to  avoid  wasting  resources.  Although  the  use  of  "fast  solvers"  such  as  AIM  or  FMM  is  not 
within  the  scope  of  this  paper,  it  should  be  noted  that  several  research  groups  are  now  using  AIM  or 
FMM  to  accelerate  hybrid  FE-BI  solutions  while  reducing  memory  consumption  [2]. 

In  the  remainder  of  this  paper,  practical  implementation  tips  are  provided  in  the  hope  of  saving 
valuable  development  time  for  those  who  wish  to  explore  the  capabilities  of  FE-BI  methods.  As  a 
general  rule,  the  methods  described  below  assume  the  following: 

1 .  planar  aperture  (e.g.  antenna  aperture  in  a  ground  plane), 

2.  total  electric  field  as  the  unknown  quantity, 

3.  anisotropic  cavity  fill,  volume  element  extrusion  from  surface  element  specification, 

4.  and  the  use  of  similar  basis  functions  (e.g.  the  volume  basis  function  collapses  to  the 
surface  basis  function  in  the  aperture). 

We  begin  with  general  implementation  comments  that  are  relevant  regardless  of  the  element  shape. 

General  Implementation  Comments 

Several  comments  may  be  made  regardless  of  the  finite  element  shape  chosen  for  implementation. 
These  comments  are  associated  with  common  elements  of  the  FE-BI  method.  The  FE-BI  equations  for 
a  total  electric  field  formulation  may  be  written  as 

jv[Vx W,  •*-'  VxW,]dV-kijv[w,  -fc.WJ]dV+jkJSj 

-kj|s  Js  [w,  ■  zx (5^  y z-  wJdS'dS  =  f‘”  +  f™ 

where  the  first  term  is  associated  with  the  curl  of  the  basis  function  (the  magnetic  field),  the  second 
term  is  associated  with  the  basis  function  itself  (the  electric  field),  the  third  term  is  necessary  to 
account  for  an  resistive  transition  conditions  present,  and  the  last  term  on  the  left-hand  side  is  the 
boundary  integral  term  (involving  a  second  kind  electric  field  dyadic  Green’s  function).  As  shown,  the 
placement  of  the  material  parameters  (fr,pr)  suggests  that  the  basis  functions  (wj  and  test  functions 
(Wj  should  be  chosen  so  that  the  dot  products  with  the  material  tensors  are  accomplished  readily. 
The  basis  functions  presented  herein  were  chosen  to  accomplish  this  goal. 

An  additional  observation  is  that  the  boundary  integral  term  does  not  involve  the  material  parameters. 
Accordingly,  since  the  basis  functions  and  test  functions  are  functionally  identical  (e.g.  Galerkin’s 
testing  procedure  was  used)  and  the  dyadic  Green’s  function  is  symmetric,  G^=G^  and 

Ge2(r,r')=  Ge2(r',r),  then  the  boundary  integral  sub-matrix  is  symmetric.  Therefore,  it  is  not 
necessary  to  store  either  the  lower  or  upper  triangle  of  that  matrix.  Since  the  BI  matrix  is  by  far  the 


(S.xW^fi.xWj) 
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greatest  consumer  of  RAM  in  the  FE-BI  method,  this  savings  of  roughly  —  complex  numbers  is 


well  worth  the  slightly  more  complex  coding.  Note  that  the  exception  to  this  rule  is  running  the  code 
on  certain  vector  processors  where  the  uneven  vector  length  associated  with  such  storage  is  inefficient. 
However,  for  the  majority  of  the  community  who  commonly  run  on  workstations,  symmetric  BI 
storage  should  always  be  used  since,  if  the  problem  does  not  fit  on  the  machine,  efficient  execution  is 
irrelevant. 


On  the  other  hand,  since  it  is  desirable  to  allow  for  the  possibility  of  anisotropic  materials  within  the 
computational  volume,  the  sparse  finite  element  matrix  should  be  stored  without  symmetry 
considerations.  Accordingly,  it  is  recommended  that  the  finite  element  and  boundary  integral  matrices 
be  stored  separately  and  hence  that  an  iterative  solver  (e.g.  BiCG,  CG,  GMRES)  be  used.  The  most 
significant  source  of  computational  cost  in  an  iterative  solver  is  the  matrix-vector  multiply.  This 
operation  can  be  accomplished  with  separate  matrix  storage  by  realizing 


Abb  +  YB] 
Am 


AnJlE, 


E, 
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where  "A”  matrices  denote  finite  element  interactions  and  the  "Y"  matrix  indicates  the  boundary 


integral  matrix.  The  subscript  "B"  denotes  a  boundary  edge  while  "I"  indicates  an  interior  edge.  This 
decomposition  of  the  finite  element  and  boundary  integral  matrices  allows  the  code  designer  to  used 
optimized  matrix  storage  and  multiplication  schemes  for  each  portion. 


Another  common  element  to  these  hybrid  finite  element  computer  programs  is  the  need  for  a 
volumetric  mesh.  If  tetrahedra  are  used  for  the  volume  mesh,  a  commercial  mesh  generating  package 
such  as  SDRC  IDEAS  may  be  used.  However,  for  many  applications,  an  extruded  mesh  will  work 
perfectly  well  without  the  need  for  additional  investment.  Extrusion  is  accomplished  by  forming 
elements  for  each  layer  of  the  mesh  by  replicating  the  nodal  distribution  of  the  aperture  for  all  lower 
layers  of  the  mesh.  Hence,  to  form  the  first  layer  of  elements,  the  aperture  nodes  are  replicated  at  the 
interface  of  the  first  and  second  layer.  Elements  are  formed  from  those  nodes  (and  the  previous  layer’s 
"bottom"  nodes)  and  edges  are  then  formed  based  on  the  chosen  finite  element.  Such  meshes  are  fairly 
rigid  and  hence  cannot  represent  the  wide  range  of  geometries  that  can  be  modeled  using  a  free  mesh 
(such  as  that  produced  by  SDRC  IDEAS). 


For  the  remainder  of  the  paper,  idiosyncrasies  of  three  typical  finite  element  shapes  are  examined. 


Brick  Elements 

Brick  elements  have  minimal  flexibility  as  a  finite  element  (e.g.  they  are  only  geometrically  suitable 
for  rectangular  volumes);  however,  they  are  rather  simple  to  implement.  Figure  1  illustrates  the  edge- 
based  brick  element  consisting  of  eight  nodes  and  twelve  edges.  Since  the  brick  is  defined  in  the  usual 
Cartesian  coordinate  system,  it  is  well  defined  for  representing  fields  in  a  rectangular  cavity  that  is 
filled  with  material  with  Cartesian  defined  anisotropic  material. 
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Figure  1 .  Brick  edge-based  finite  element. 

This  brick  element  has  twelve  basis  functions  defined  as: 

W,(x,y,z)=Wx(x,y,z;-,yu,z1,~)  W2(x,y,z)=  Wx(x,ytz;-,y,,z15+) 

W5  (x,  y ,  z)  =  Wx  (x,  y,  z;-,  y u ,  zu ,+)  W6  (x,  y, z)  =  Wx  (x,  y, z;-,  y , , z u ,-) 

W3 (x,y,z)=  Wy (x, y,z;xu ,-,z, ,-)  W4 (x, y, z)=  Wy (x, y,z;  x^z, ,+) 

W7 (x, y, z)=  Wy (x, y,z;xu ,-,zu ,+)  W8 (x, y,z)=  Wy (x, y,z; x, ,-,zu ,-) 

W9 (x, y,z)=  Wz (x, y,z;xu,yu ,-,+)  W10(x, y,z)=  Wz (x,y,z; x, , y „ ,•  ,-) 
Wn(x,y,z)=Wz(x,y,z;xu,y W12(x,y,z)=Wz(x,y,z;xpy 
where  the  three  fundamental  Cartesian  basis  functions  are  given  by 

Wx(x,y,z;x,y,z,s)=-^-(y-yXz-z) 

be 

W  (x,y,z;x,y,z,s)=— (x-xXz-z) 
ac 

Wz  (x,  y ,  z;  x ,  y ,  z,  s  (x  -  x  Xy  -  y  ) 

ab 

Note  that  each  basis  function  is  associated  with  a  sign  that  caused  the  sum  of  the  two  basis  functions 
associated  with  each  edge  (across  two  adjoining  bricks)  to  form  a  roof-top  basis.  These  signs  could  be 
omitted  by  defining  the  local  edges  with  the  signs  in  place;  however,  this  leads  to  additional 
programming  for  computation  of  element  matrices. 

Comments  on  Implementing  Bricks 

Bricks  are  rather  easy  to  implement  efficiently  due  to  the  inherent  rigidity  of  the  mesh.  One  a  uniform 
discretization  is  chosen  for  the  aperture  (e.g.  fixing  a  and  b  above),  the  only  geometrical  degree  of 
freedom  remaining  is  the  cell  thickness  (c).  Hence,  the  element  matrix  is  identical  for  each  layer  of  the 
mesh  and  accordingly,  only  two  12x12  element  matrices  need  be  computed  stored  for  each  layer 
regardless  of  the  material  distribution.  In  addition,  since  all  the  edges  are  either  parallel  or 
perpendicular  to  each  other,  the  element  matrices  are  simplified.  The  boundary  integral,  as  mentioned 
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before  if  the  edges  in  the  aperture  are  all  co-planar  and  uniformly  spaced,  results  in  a  block  Toeplitz 
matrix  that  can  be  manipulated  using  Fast  Fourier  Transforms.  In  addition  to  reduced  computational 
complexity,  the  block  Toeplitz  matrix  possesses  sufficient  symmetry  and  redundancy  so  that  only  one 
full  row  and  one  partial  row  of  the  matrix  need  be  computed  and  stored. 

Further  details  regarding  the  element  matrices,  boundary  integral  matrix  storage,  and  use  of  Fast 
Fourier  Transforms  in  computing  matrix-vector  products  is  given  in  [2].  Software,  including  a  fully 
functional  FE-BI  computer  program  using  bricks  and  the  BiCG-FFT  solver  is  available  for  download 
at:  http://www.eecs.umich.edu/RADLAB/bookprograms.html 


Right  Prism  Elements 

The  vector  edge-based  expansion  functions  used  for  right  prisms  are  developed  by  multiplying  the 
traditional  Rao-Wilton-Glisson  (RWG)  basis  function  [3]  with  a  function  of  the  prism  height  for  the 
transverse  (e.g.  x-  and  y-component)  functions.  The  normal  functions  are  simply  the  node-based 
simplex  basis  function  (see  for  example  [4])  multiplied  by  z .  Hence,  the  transverse  basis  functions  for 
edges  on  the  top  of  the  prism  are  given  by 


Wv 


=  f-^'b,=(^l^[(x-x1)?-(y-y1X]  X - 1.2,3 


Az,2S 


where  %  is  the  local  edge  number  and  i  is  the  global  edge  number.  Not  that  the  local  edge  numbers  are 
defined  so  that  the  edge  is  opposite  the  local  node  number  as  shown  in  Figure  2. 


Figure  2.  Illustration  of  local  node  and  edge  numbering  for  a  triangle. 
Transverse  expansion  functions  on  the  bottom  of  the  prism  are  given  by: 


Finally,  the  normal  (z-directed)  expansion  functions  are  given  by: 


K 


*  (*kiyic2  Yk2)X  +  (Xlc2  ~~Xkl)y  y=7g9 

■Z  2Se  * 


where  the  indices  kl  and  k2  are  given  in  the  following  table 
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X 

kl 

k2 

7 

2 

3 

8 

3 

1 

9 

1 

2 

As  stated  previously,  the  choice  of  basis  functions  is  dictated  by  the  physical  constraints  of  the 
geometry,  the  requirements  for  representation  of  a  field,  and  convenience  for  implementation.  The 
later  consideration  is  understood  by  considering  a  version  of  the  basis  functions  given  in  Chapter  5  of 
[2],  In  that  presentation,  the  basis  functions  are  represented  in  a  local  coordinate  system  rather  than 
the  global  treatment  given  above.  Although  the  solutions  are  identical,  inclusion  of  anisotropic 
materials  is  more  difficult  if  the  basis  functions  are  expressed  in  local  coordinates  since  a 
transformation  back  to  global  coordinates  is  required  to  perform  the  necessary  dot  products.  Hence, 
the  form  of  the  basis  functions  given  above  is  more  convenient  for  implementation  than  the  form  given 
in  [2]. 

Comments  on  Implementing  Right  Prisms 

Right  prisms  are  moderately  difficult  to  implement  since  a  surface  mesh  is  required  that  may  be  rather 
complex.  Various  sources  of  surface  meshes  are  available  ranging  from  academic  products  (such  as 
CMU’s  TRIANGLE  code)  to  commercial  products  (such  as  SDRC  IDEAS).  However,  once  the 
surface  mesh  is  provided,  extrusion  of  prism  elements  is  straightforward.  Computation  of  element 
matrices  is  tedious,  though  functionally  simple.  Although  the  boundary  integral  matrix  is  fully 
populated,  it  is  symmetric;  thus  only  the  upper  (or  lower)  triangle  need  be  computed  and  stored.  The 
matrix-vector  product  requires  more  time  than  the  case  of  a  uniformly  discretized  brick  code  (since  the 
latter  can  use  FFTs). 

Tetrahedral  Elements 

The  third  element  shape  investigated  in  this  paper  is  the  tetrahedra.  Before  presenting  the  basis 
functions,  a  discussion  of  creating  the  mesh  is  in  order.  A  commercial  mesh  generator  such  as  IDEAS 
can  be  used;  however,  tetrahedra  can  also  be  formed  from  prisms  and  bricks.  For  the  case  of  bricks  (or 
in  general  any  hexahedral  element),  each  hexahedra  can  be  broken  into  five  tetraheda.  A  good 
discussion  of  this  process  is  available  from  the  University  of  Missouri-Rolla  via  the  World-Wide  Web 
at:  http://www.emclab.umr.edU/emap4/Meshing.html#mesh  gen.  For  the  case  of  prisms,  each  prism 
can  be  divided  into  three  different  tetrahedra.  Two  different  subdivision  schemes  are  required  for  both 
the  hexahedra  and  prisms.  This  is  to  allow  diagonal  edges  on  adjacent  elements  to  align  with  each 
other  so  that  on  any  face  of  the  element,  crossed  edges  do  not  occur.  Figure  3  and  Figure  4  illustrate 
the  subdivision  of  two  types  of  right  prism  into  three  tetrahedra. 
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Figure  3.  Subdivision  of  prism  type  1  into  three  tetrahedra. 


Figure  4.  Subdivision  of  prism  type  2  into  three  tetrahedra. 

In  these,  local  edge  1 1  has  an  opposite  orientation  in  the  two  prism  types  and  this  allows  the  alignment 
of  adjacent  prism  diagonal  edges.  Note  that  the  user  may  need  to  rotate  a  prism  to  accomplish  the 
required  alignment. 

Now  that  the  tetrahedra  have  been  created,  it  is  necessary  to  expand  the  unknown  fields  within  each 
element  in  terms  of  a  suitable  basis  function.  To  simplify  the  implementation  of  anisotropic  material, 
the  form  of  these  basis  functions  reported  in  [5]  is  recommended.  The  vector  basis  function  is  defined 
as 

W,  =  (Lj  VL:  -Lf  VL=  )  = 

where  the  subscripts  refer  to  the  two  local  node  numbers  that  define  the  edge  (edge  is  directed  from 
node  jj  to  j2),  lj  is  the  length  of  the  j*  edge,  and  the  nodal  basis  functions  are  given  by 

Li=i^(a'+blx+c')'+d'z) 

In  this,  V6  represents  the  volume  of  the  tetrahedral  element  and  the  coefficients  a,b,c,d,  are  determined 
based  on  the  coordinates  of  the  four  local  nodes  that  define  the  tetrahedral  element.  The  utility  of  these 
element  for  implementing  the  FE-BI  method  for  anisotropic  materials  lies  in  the  fact  that  both  the  basis 
vector  and  its  curl  are  readily  expressed  in  terms  of  Cartesian  quantities.  For  example,  the  curl  of  the 
basis  vector  is  given  by 

vxw,  =-j^l*(cA  -cA>HdA  -‘'AMVj,  -  V  J 

Hence,  the  preferred  basis  function  it  one  that  satisfies  all  the  requirements  of  a  valid  electromagnetic 
field  and  one  convenient  for  implementing  the  required  dot  products. 

Comments  on  Implementing  Tetrahedra 
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Tetrahedra  are  no  more  difficult  in  principle  to  implement  than  are  the  right  prisms.  The  only  twist  is 
that  the  source  of  meshes  could  be  from  a  variety  of  sources.  Consequently,  the  designer  has  less 
control  over  the  mesh  than  in  the  case  of  extruded  meshes.  It  is  therefore  important  to  assure  that  the 
local  node  numbering  meets  any  conventions  required  in  the  code.  For  example,  to  compute  the 
volume  of  the  element,  one  might  use  the  equation 

j  f(*i  - x4  f(y 2  ~  y4 Xz3  “z4 )~  6^3  -  y4  Xz2  -  Z4  )]+] 

V  =-|(yi  —  y4X(Z2  Z4  X^3  ^4  )  (Z3  ~  Z4XX2  “  X4)]+  | 

Kzi -z4X(x2 — x4Xy3 -y4)-(x3 -x4Xy2 -y4)]  J 

This  equation  assumes  a  certain  local  ordering  (e.g.  sense)  to  the  tetrahedral  element.  If  the  opposite 
sense  is  used,  the  computed  volume  is  negative.  To  correct  this,  the  user  can  swap  local  nodes  1  and  2. 

Closing  Remarks 

In  this  paper,  implementation  details  for  three  popular  finite  element  shapes  are  discussed.  Tips  to  the 
developer  are  given  that  will  hopefully  make  the  creation  of  a  hybrid  finite  element-boundary  integral 
computer  program  a  little  less  daunting  to  the  novice  software  designer.  Basis  functions  that  are 
particularly  suitable  for  use  in  evaluating  fields  in  anisotropic  materials  were  presented.  Key 
implementation  hints  will  be  given  at  the  meeting. 
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Abstract 

Hybrid  finite  element  methods  have  been  applied  for  accurate  and  efficient  analysis  of  various 
conformal  antennas.  This  paper  presents  a  series  of  results  that  demonstrate  the  value  of  such 
methods  for  realistic  applications.  Furthermore,  the  potential  of  combining  lowest  and  higher 
order  field  expansions  selectively  is  demonstrated  and  an  example  of  ferrite  antenna  design  using 
sequential  quadratic  programming  is  given. 


Introduction 

Hybrid  finite  element  /  boundary  integral  (FE/BI)  methods  have  been  widely  applied  for  analysis 
of  conformal  antennas  and  arrays  [1,  2].  The  requirements  for  high  degrees  of  accuracy  as  well  as 
modeling  of  volumetric  materials  and  structural  details  associated  with  antenna  geometries  and 
their  feeds  can  be  addressed  by  the  rigor  and  adaptability  of  the  finite  element  method  (FEM). 
Nevertheless,  these  attributes  are  often  not  sufficient  for  the  practical  characterization  of  complex 
antenna  apertures  and  arrays  due  to  the  CPU  and  memory  requirements  associated  with  the  BI 
portion  of  the  hybrid  FE/BI  system.  The  BI  subsystem. dominates  the  CPU  requirements  with  its 
0(Nj)  computations,  where  Ns  refers  to  the  number  of  unknowns  on  the  antenna/ array  aperture. 
However,  recent  developments  of  fast  integral  methods  [3,  4,  5]  and  adaptive  higher  order  or  multi¬ 
scale  elements  [6]  have  allowed  us  to  circumvent  some  of  these  CPU  bottlenecks  and  are  permitting 
the  analysis  of  large  finite  apertures  and  multilayered  arrays.  Most  importantly,  these  speed-ups 
can  lead  to  algorithms  which  can  be  used  for  design  purposes. 

This  paper  will  consider  the  performance  of  three  fast  hybrid  FE/BI  methods  in  the  context  of 
various  antenna  and  array  analysis  examples  :  (a)  Hybrid  FE/BI- AIM,  where  the  adaptive  integral 
method  (AIM)  [3]  is  used  to  speed  up  the  BI  matrix-vector  products,  (b)  Hybrid  FE/BI-FMM, 
where  the  fast  multipole  method  (FMM)  [4]  is  used  to  speed  up  the  BI  matrix-vector  products  and 
(c)  Hybrid  FE/BI-FSDA,  where  the  fast  spectral  domain  algorithm  (FSDA)  [5]  is  used  to  speed 
up  the  BI  matrix-vector  products.  FSDA  is  a  recent  addition  to  the  fast  methods  and  draws  its 
speed-up  by  working  directly  with  the  spectral  form  of  the  BI.  As  a  result,  it  is  particularly  attrac¬ 
tive  for  multilayered  periodic  array  analysis  since  in  that  case  the  spectrum  of  the  Green’s  function 
is  discrete  and  of  finite  bandwidth.  Another  important  difference  between  FSDA  and  AIM/FMM 
is  that  FSDA  provides  considerable  speed-up  even  for  small  values  of  Ns-  The  performance  of 
these  hybrid  methods  will  be  demonstrated  with  their  application  to  the  characterization  of  various 
antenna  problems.  In  addition,  we  will  show  the  advantage  of  using  hierarchical  higher  order  tan¬ 
gential  vector  finite  elements  (TVFEs)  [6]  for  accurate  and  efficient  resonance  frequency  prediction. 
Finally,  we  will  give  an  antenna  design  example  where  sequential  quadratic  programming  (SQP  - 
a  gradient-based  optimization  algorithm)  [7]  is  used  for  optimization. 


Frequency  selective  surface  of  slot-coupled  microstrip  patches 

Frequency  selective  surfaces  (FSSs)  are  of  practical  relevance  for  construction  of  for  instance  po¬ 
larizers,  filters  and  radomes.  In  Fig.  1,  the  geometry  of  the  unit  cell  of  an  FSS  array  consisting 
of  slot-coupled  microstrip  patches  is  depicted.  This  infinite  array  was  investigated  in  [8]  and  acts 
as  a  strongly  resonant  band-pass  structure.  The  incidence  angle  $o  of  the  transverse  electric  (TE) 
polarized  plane  wave  used  to  excite  the  structure  varies  with  frequency  according  to  the  waveguide 
measurement  setup  in  [8].  Fig.  2  compares  the  transmission  coefficients  obtained  by  FE/BI-AIM 
and  FE/BI-FSDA  solutions  to  measured  data  presented  in  [8].  The  agreement  between  the  three 
curves  is  good.  We  note  that  the  FE/BI-FSDA  results  are  closer  to  the  measured  data  than  those 
obtained  by  the  FE/BI-AIM  approach.  With  respect  to  complexity,  the  FE/BI-AIM  solution  used 
98636  volume  and  5308  BI  unknowns  on  each  of  the  top  and  bottom  BI  surfaces.  Also,  foT  the 
FE/BI-AIM  approach,  the  BI  was  implemented  as  a  conventional  spatial  domain  formulation  with 
Ewald  acceleration  of  the  Green’s  function  modal  series  [9,  10]  and  AIM  was  employed  to  speed 
up  the  BI  portion  of  the  code.  The  required  CPU  time  for  the  FE/BI-AIM  implementation  on  a 
Pentium  II  PC/266  MHz  was  between  3  and  4  hours  per  frequency  point.  Using  the  FE/BI-FSDA, 
the  CPU  times  were  considerably  decreased  even  though  the  number  of  unknowns  was  increased  by 
putting  additional  air  layers  between  the  microstrip  patches  and  the  BI  termination  to  reduce  the 
number  of  Floquet  modes  needed.  Specifically,  the  number  of  volume  unknowns  grew  to  a  value 
of  137804  and  the  number  of  BI  unknowns  was  increased  to  14828  in  each  of  the  top  and  bottom 
BI  surfaces.  The  required  CPU  time  on  the  PC  for  FSDA  3+  (Floquet  mode  series  indices  ranging 
from  -3  to  +3)  was  about  50  minutes  per  frequency  point. 


Figure  1:  FSS  unit  cell  of  the  aperture  cou¬ 
pled  microstrip  patches  as  suggested  in  [8]. 
The  parameters  are  er  —  2.2,  d  =  1.6  mm, 
a  =  36.07  mm,  b  =  34.04  mm,  Ws  —  2  mm, 
Ls  =  8  mm,  Wp  =  Lp  =  28  mm. 


Figure  2:  TE  transmission  coefficient  of  the 
FSS  structure  in  Fig.  1  for  different  FE/BI 
models  compared  to  measured  results  from 
[8].  <p  =  0°,  varying  from  57°  to  32°  for  / 
varying  from  2.5  GHz  to  4.0  GHz,  according 
to  waveguide  measurement  setup  in  [8]. 


FE/BI-FMM  analysis  of  cavity-backed  antennas  on  curved  plat¬ 
forms 

In  this  section,  we  present  an  approximate  FE/BI-FMM  solution  to  problems  involving  antennas 
recessed  in  curved  platforms.  For  cylindrical  platforms,  the  closed  form  Green’s  function  can  be 
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used  to  formulate  the  problem  [11].  The  lack  of  a  closed  form  Green’s  function  fox  arbitrary  doubly 
curved  structures  forces  one  to  use  an  approximate  Green’s  function.  In  this  work,  the  half  space 
Green’s  function  is  used  as  if  the  cavity  resides  in  a  flat  ground  plane.  It  is  demonstrated  that 
this  approach  is  applicable  for  geometries  which  are  not  highly  curved.  The  FMM  [4]  is  used  in 
the  BI  formulation  to  speed  up  the  evaluation  of  matrix-vector  products  and  lower  the  memory 
requirements. 


5x1  patch  antenna  arrays  on  flat  and  curved  platforms  are  examined.  The  effect  of  curvature  on 
the  radiation  pattern  and  input  impedance  of  the  antenna  will  be  predicted  using  the  FE/BI-FMM 
approach.  For  the  flat  arrays,  each  antenna  element  is  a  square  patch  of  size  3.5  cm  x  2.625  cm. 
We  consider  the  two  different  center- to- center  spacings  6.125  cm  and  5.250  cm.  The  flat  arrays  are 
backed  by  rectangular  cavities  of  dimensions  33.25  cm  x  6.125  cm  X  0.3175  cm  and  28.00  cm  X 
6.125  cm  x  0.3175  cm,  respectively,  and  recessed  in  an  infinite  metallic  ground  plane.  The  cavities 
are  filled  with  a  material  of  relative  permittivity  2.32.  Curved  arrays  are  formed  from  the  flat  arrays 
by  wrapping  them  onto  metallic  circular  cylindrical  platforms  of  radius  20  cm  (spacing  6.125  cm) 
and  10  cm  (spacing  5.250  cm).  For  all  arrays,  the  five  patches  are  fed  in-phase  by  an  offset  vertical 
probe  of  constant  current.  The  surface  mesh  (discretization  into  squares  of  dimensions  0.4375  cm 
x  0.4375  cm  and  subsequent  division  into  triangles)  is  given  in  Fig.  3  for  the  curved  array  with 
spacing  6.125  cm. 


The  effects  of  curvature  on  the  radiation  pattern  are  demonstrated  in  Fig.  4  where  for  the 
spacing  6.125  cm  the  radiation  patterns  for  flat  and  curved  platforms  are  compared.  The  side  lobes 
for  the  flat  array  are  seen  to  be  eliminated  when  the  array  is  curved.  Fig.  5-8  show  the  dependence 
on  curvature  of  the  input  impedance  of  the  first  and  second  array  element  for  flat  and  curved  array 
geometries.  This  is  shown  for  the  two  different  element  spacings  6.125  cm  and  5.250  cm.  The  effect 
of  coupling  is  observed  for  the  input  impedance  of  the  second  element  when  the  spacing  between 
the  elements  is  5.250  cm. 


Figure  3:  Illustration  of  5  x  1  con-  Figure  4:  Radiation  pattern  of  flat  (left)  and  curved 

formal  array  (right)  array  for  spacing  6.125  cm. 


Resonant  frequency  determination  using  selective  field  expansion 

Consider  a  square  metallic  patch  antenna  backed  by  a  rectangular  cavity  recessed  in  an  infinite 
metallic  ground  plane,  as  illustrated  in  Fig.  9.  The  cavity-backed  patch  antenna  is  situated  in  free 
space  characterized  by  the  permittivity  e0  and  the  permeability  fi0.  The  cavity  is  of  dimensions  1.85 
cm  x  1.85  cm  x  0.15  cm  and  filled  with  a  dielectric  material  of  permittivity  10  £0  and  conductivity 
0.0003  S/cm.  The  patch  is  of  side  length  0.925  cm  and  centered  in  the  cavity  aperture.  It  is  fed 
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Figure  5:  Input  impedance  of  the  1st  element 
of  flat  and  curved  (20  cm  cylinder)  array  with 
spacing  6.125cm. 


Figure  7 :  Input  impedance  of  the  1st  element 
of  fiat  and  curved  (10  cm  cylinder)  array  with 
spacing  5.250cm. 


Figure  6:  Input  impedance  of  the  2nd  ele¬ 
ment  of  flat  and  curved  (20  cm  cylinder)  ar¬ 
ray  with  spacing  6.125cm. 


Figure  8:  Input  impedance  of  the  2nd  ele¬ 
ment  of  flat  and  curved  (10  cm  cylinder)  ar¬ 
ray  with  spacing  5.250cm. 


by  a  vertical  coaxial  line  whose  outer  conductor  is  attached  to  the  ground  plane  and  whose  inner 
conductor  is  attached  to  the  patch  at  the  mid  point  of  an  edge,  as  illustrated  in  Fig.  9.  The  coaxial 
feed  is  modeled  as  a  vertical  probe  of  constant  current. 

An  almost  identical  antenna  was  considered  by  Schuster  and  Luebbers  [12].  In  [12],  the  cavity 
walls  and  the  ground  plane  were  removed  and  a  similar  patch  on  a  similar  but  finite  grounded 
dielectric  substrate  was  analyzed  using  the  finite  difference  time  domain  method.  In  spite  of  these 
geometrical  differences,  the  two  antennas  are  expected  to  have  nearly  the  same  input  impedance 
and,  consequently,  nearly  the  same  resonant  frequency  since  the  dominant  fields  are  confined  to  a 
volume  under  and  in  the  near  vicinity  of  the  patch.  The  resonant  frequency  was  estimated  in  [12] 
to  be  4.43  GHz.  The  resistance  at  resonance  was  found  to  be  400  ft  while  the  reactance  was  in 
the  range  of  230  ft  to  -170  ft  close  to  resonance.  All  results  in  [12]  were  found  with  a  very  fine 
discretization  and  can  be  considered  accurate. 
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Figure  9:  Square  metallic  patch  antenna  backed  by  a  dielectric-filled  rectangular  cavity  recessed  in 
an  infinite  metallic  ground  plane. 


Two  different  TVFE  options  are  applied.  The  first  TVFE  option  is  to  use  the  mixed-order 
TVFE  of  order  0.5  [6]  throughout  the  mesh.  For  a  mesh  of  average  edge  length  0.260  cm  (Case  1), 
the  input  impedance  is  determined  as  a  function  of  frequency  and  the  resonant  frequency  of  the 
patch  is  predicted.  The  coarse  discretization  of  Case  1  means  that  this  resonant  frequency  is  most 
likely  not  accurate.  For  meshes  of  average  edge  lengths  of  0.188  cm  (Case  2),  0.153  cm  (Case  3) 
and  0.133  cm  (Case  4),  more  accurate  resonant  frequencies  but  also  higher  computational  costs  can 
be  expected.  The  second  TVFE  option  is  to  use  the  mixed-order  TVFE  of  order  1.5  [6]  close  to  the 
radiating  edges  and  the  mixed-order  TVFE  of  order  0.5  elsewhere.  For  the  meshes  of  average  edge 
length  0.260  cm  (Case  5)  and  0.188  cm  (Case  6),  the  input  impedance  is  again  determined  and  the 
resonant  frequency  is  again  predicted.  The  effectiveness  of  this  approach  (Case  5-6)  in  terms  of 
accuracy  /  CPU  time  /  memory  requirements  is  compared  to  the  previous  one  (Case  1-4).  The  six 
cases  are  summarized  in  Tab.  1. 

Real  and  imaginary  parts  of  the  input  impedance  as  a  function  frequency  are  given  in  Fig. 
10  -  11  for  Case  1-6  and  corresponding  resonant  frequencies  are  provided  in  Tab.  1.  For  Case  1-4,  a 
larger  and  larger  resonant  frequency  is  observed  as  the  mesh  becomes  denser  and  denser.  However, 
even  for  Case  4,  the  error  as  compared  to  the  result  obtained  by  Schuster  and  Luebbers  is  quite 
large  (2.98  %)  for  resonant  frequency  computation.  Use  of  selective  field  expansion  (Case  5-6)  leads 
to  a  significant  accuracy  improvement.  Case  5  (error  2.42  %)  gives  a  more  accurate  result  than  Case 
1-4  and  Case  6  (error  0.16  %)  matches  the  result  by  Schuster  and  Luebbers  almost  exactly.  The 
computational  costs  (number  of  unknowns,  number  of  BI  unknowns,  number  of  non-zero  matrix 


Case 

TVFE 

order(s) 

Average 

edge 

length 

[cm] 

Resonant 

frequency 

[GHz] 

#  of 

unknowns 

#  of  BI 
unknowns 

#  of 
matrix 
entries 

Time  per 
frequency 
point 
[sec] 

1 

0.5 

0.260 

3.974 

345 

120 

17119 

7.52 

2 

0.5 

0.188  : 

4.147 

817 

288 

89695 

44.78 

3 

0.5 

0.153  i 

4.258 

1489 

528 

291359 

222.92 

4 

0.5 

0.133  | 

4.302 

2361 

840 

725791 

771.59 

5 

0.5/1.5 

0.260 

4.323 

827 

120 

30675 

17.33 

6 

0.5/1.5 

0.188 

4.437 

1467 

288 

107963 

77.28 

Table  1:  Computational  costs  for  Case  1-6  for  antenna  in  Fig.  9. 
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Figure  10:  Real  part  of  the  input  impedance  of  the  antenna  in  Fig.  9  for  Case  1-6. 


Figure  11:  Imaginary  part  of  the  input  impedance  of  the  antenna  in  Fig.  9  for  Case  1-6. 


entries  (memory  usage)  and  CPU  time  per  frequency  point)  to  obtain  these  results  are  also  given  in 
Tab.  1.  It  is  evident  that  the  second  TVFE  option  corresponding  to  Case  5-6  is  significantly  more 
attractive  than  the  first  TVFE  option  corresponding  to  Case  1-4.  Case  5  gives  a  more  accurate 
result  than  Case  4  but  uses  only  4.22  %  of  the  memory  and  2.15  %  of  the  CPU  time  that  Case  4 
does.  The  accuracy  of  Case  6  is  vastly  superior  to  that  of  Case  4  and  yet  Case  6  uses  only  14.88  % 
of  the  memory  and  10.02  %  of  the  CPU  time  that  Case  4  does. 


Resonant  frequency  tracking  for  ferrite  antennas 

Recently,  the  non-reciprocal  properties  of  ferrites  have  been  used  to  control  printed  circuits  and 
antennas.  The  material  properties  of  ferrites  are  mainly  controlled  by  the  direction  and  strength 
of  an  externally  applied  magnetic  bias  field,  thus  enabling  control  of  various  antenna  radiation 
and  scattering  characteristics.  These  unique  properties  of  ferrite  materials  provide  many  desirable 
features  such  as  tunability,  polarization  diversity,  beam  steering  and  radar  cross  section  control. 
Furthermore,  because  of  the  additional  variables  such  as  bias  field  strength,  bias  direction  and 
saturation  magnetization,  optimization  methods  are  very  suitable  for  ferrite  antennas. 

Resonant  frequency  tracking  and  tuning  are  important  for  ferrite  patch  antennas.  Instead  of 
locating  the  resonance  by  frequency  sweep,  we  apply  SQP  in  conjunction  with  a  FE/BI  simulation 
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[7]  to  develop  a  set  of  design  curves.  The  optimization  problem  can  be  formulated  as  follows: 

Find  /»  which  minimizes  jImag(Z,n(/))|  subject  to  fi  <  /  <  fu 

where  Z,„  is  the  frequency-dependent  input  impedance  and  fi  and  fu  are  lower  and  upper  bounds 
for  the  frequency.  We  use  the  definition  that  the  antenna  reaches  resonancy  when  the  imaginary 
part  of  the  input  impedance  is  zero. 

As  an  example,  we  study  an  antenna  similar  to  the  one  in  [13],  with  L  =  W  —  0.61  cm, 
h  =  0.127  cm,  and  eT  =  15.  We  place  the  antenna  in  a  1.22  cm  x  1.22  cm  cavity  and  a  probe  feed  is 
located  at  the  mid  point  of  an  edge.  The  geometry  is  illustrated  in  Fig.  12.  For  the  isotropic  case, 
the  resonance  should  occur  around  5.5  GHz.  The  antenna  is  biased  parallel  to  one  of  the  edges  of 
the  patch  instead  of  normal  to  the  patch  as  in  [13].  We  solve  the  optimization  problem  for  various 
values  of  the  bias  field  strength  H0  and  saturation  magnetization  A-kMs.  The  results  are  plotted 
in  Fig.'  13-14  for  x-bias  (bias  field  perpendicular  to  the  edge  where  the  feed  is  located)  and  y-bias 
(bias  field  parallel  to  the  edge  where  the  feed  is  located),  respectively.  It  is  seen  that  the  resonant 
frequency  of  the  antenna  shifts  to  a  higher  frequency  as  Hq  or  4 irMs  increases.  Also,  the  resonant 
frequencies  for  y-bias  are  higher  than  those  for  x-bias.  This  provides  more  tuning  flexibility.  Other 
ferrite  parameters  may  also  play  a  role  in  resonant  frequency  tuning. 

e0  Ho 

0.305  cm  0.61  cm  0.305  cm 
I  0.127cm  15£0  JJ.0  r 


Figure  12:  Illustration  of  cavity-backed  patch  antenna. 


Figure  13:  Resonant  frequency  for  cavity- 
backed  patch  antenna  in  Fig.  12  as  a  function 
of  the  bias  field  and  saturation  magnetiza¬ 
tion  (x-bias). 


Figure  14:  Resonant  frequency  for  cavity- 
backed  patch  antenna  in  Fig.  12  as  a  function 
of  the  bias  field  and  saturation  magnetiza¬ 
tion  (y-bias). 
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Conclusions 

This  paper  presented  a  series  of  results  that  demonstrate  the  capabilities  of  fast  hybrid  FE/BI  meth¬ 
ods  for  accurate  and  efficient  analysis  of  various  conformal  antennas.  FE/BI- AIM,  FE/BI-FMM  as 
well  as  FE/BI-FSDA  was  considered.  Furthermore,  the  potential  of  selective  field  expansion  using 
hierarchical  higher  order  TVFEs  was  demonstrated  and  an  example  of  ferrite  antenna  design  using 
SQP  was  provided. 
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1  Introduction 

The  symmetry  present  in  body  of  revolution  (BOR)  electromagnetic  scattering  problems  permits  an 
efficient  numerical  solution  using  a  two-dimensional  (2-D)  technique  [l]-[5].  However,  in  many  practical 
problems,  the  rotational  symmetry  is  broken  by  the  presence  of  small  appendages  (see  Fig.  1).  Thus, 
a  three-dimensional  (3-D)  computational  method  is  required  to  rigorously  compute  the  electromagnetic 
scattering.  Because  of  the  increased  computational  complexity  of  a  3-D  method,  a  hybrid  method  is 
developed  which  allows  the  scattering  from  small  appendages  to  be  approximately  combined  with  the 
scattering  from  a  large  BOR  in  a  similar  manner  to  other  hybrid  techniques  [6]— [9] .  The  rotational 
symmetry  can  then  be  exploited  in  the  computation  of  the  scattering  from  the  large  BOR.  The  hybrid 
method  makes  use  of  the  finite  element  method  (FEM)  for  BOR  scattering  as  described  in  [1],  and  the 
method  of  moments  (MoM)  as  described  in  [8].  The  remainder  of  this  paper  discusses  the  hybridization 
of  the  two  methods.  The  formulation  is  presented  in  Section  2,  numerical  results  are  given  in  Section  3, 
and  concluding  remarks  are  found  in  Section  4. 


2  Formulation 

The  hybrid  formulation  is  best  understood  by  first  considering  the  computation  of  the  scattering  from 
a  BOR  with  appendages  (see  Fig.  1)  entirely  by  the  MoM.  Assuming  that  the  entire  target  consists  of 
perfect  conductors,  the  application  of  the  MoM  uses  the  integral  equation 

E(r)  =  E1  (r)  -  jkor}o  jj  G0  (r,  r')  •  J (v')dS'  (1) 

s 

where  ko  =  is  the  free-space  wavenumber,  770  =  vVo/eo  is  the  impedance  of  free  space,  Go  is 

the  free-space  dyadic  Green’s  function,  E*  is  a  known  incident  electric  field,  J  is  the  unknown  current  on 
the  surface  of  the  target,  and  S  is  the  surface  of  the  entire  target.  When  using  the  MoM,  J  is  found  by 
discretizing  the  surface  S  and  applying  the  appropriate  boundary  condition  on  the  discretized  surface. 


Figure  1:  Example  of  a  large  BOR  with  small  appendages. 
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The  scattered  field  is  then  found  from 


Es(r)  =  -jkovo  J f  G„(r,  r')  -  J(r ')dS'.  (2) 

s 

This  3-D  method  generates  a  dense  matrix  equation;  therefore,  if  the  geometry  is  large,  the  method  is  very 
computationally  intensive.  Further,  if  the  BOR  contains  inhomogeneous  materials,  the  surface  integral 
in  Eq.  (1)  must  be  replaced  with  a  volume  integral,  further  increasing  the  computational  complexity. 

Because  of  the  unfavorable  computational  complexity  in  solving  Eq.  (1)  with  the  MoM,  an  alternate, 
hybrid  method  is  sought.  In  the  hybrid  method,  the  MoM  is  applied  to  the  small  appendages  only.  Thus, 
Eq.  (1)  becomes 


E(r)  =  Eg0R,(r)  -  jkoVQ  J f  GB0R(r,  r')  •  J(r')dS"  (3) 

Sapp 

where  EgOR  represents  the  incident  field  on  the  appendages  in  the  presence  of  the  BOR,  5app  represents 
the  surface  of  the  appendages,  and  GBor  represents  the  dyadic  Green’s  function  in  the  presence  of  the 
BOR.  The  solution  of  Eq.  (3)  requires  the  discretization  of  Sapp  rather  than  S  and  thus  is  much  less 
computationally  intensive.  The  incident  field  Eg0R  is  calculated  using  the  FEM  [1],  but  difficulty  is  still 
encountered  in  solving  Eq.  (3)  with  the  MoM  because  the  Green’s  function  Gbor  is,  in  general,  unknown. 
The  unknown  Green’s  function  is  needed  not  only  for  the  solution  of  Eq.  (3),  but  also  to  compute  the 
scattered  electric  field  from 


Es(r)  =  -jk0r)0  J f  GB0R(r,  r')  ■  J(r')dS".  (4) 

^App 

The  difficulty  posed  by  the  unknown  Green’s  function  GBor  is  alleviated  in  two  ways.  An  approximate 
Green’s  function  is  used  for  the  solution  of  Eq.  (3),  and  an  alternate  method  based  on  the  reciprocity 
theorem  is  used  to  compute  Es. 

There  are  four  steps  to  using  the  hybrid  method.  First,  the  FEM  described  in  [1]  is  used  to  compute 
scattering  from  the  large  BOR  alone.  Then,  the  result  of  the  first  step  is  used  to  compute  Eg0R,  the 
incident  field  on  the  appendages  in  the  presence  of  the  BOR.  Next,  the  MoM  with  an  approximate 
Green’s  function  is  used  to  solve  Eq.  (3)  for  J,  the  current  on  the  appendages.  Finally,  the  scattered  field 
generated  by  J  is  evaluated  using  reciprocity  and  added  to  the  scattered  field  from  the  BOR,  which  is 
found  in  the  first  step.  The  result  is  an  approximation  to  scattered  field  from  the  entire  structure.  In  the 
remainder  of  this  section,  the  construction  of  the  approximate  Green’s  function  and  the  computation  of 
the  scattered  field  by  reciprocity  are  each  discussed  in  turn. 

2.1  Approximate  Green’s  function 

The  approximate  Green’s  function  used  to  solve  Eq.  (3)  is  developed  by  approximating  the  BOR  on  which 
the  appendages  reside  as  a  long  cylinder.  The  approximate  Green’s  function  must  model  the  significant 
field  interactions  between  points  on  the  appendages  but  may  neglect  other,  less  significant  interactions. 
The  interactions  are  classified  into  four  types.  The  first  type  of  interaction  is  the  direct  path  interaction 
between  two  points,  and  examples  of  this  type  of  interaction  are  illustrated  in  Fig.  2a.  The  second  type 
of  interaction  involves  a  reflection  by  the  large  BOR,  and  examples  are  illustrated  in  Fig.  2b.  The  third 
interaction  type  is  the  surface  wave  interaction  which  is  illustrated  in  Fig.  2c.  All  other  interactions 
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are  classified  as  the  fourth  type.  Most  of  these  are  complex  interactions  which  are  not  modeled  by  the 
cylinder  approximation  to  the  BOR,  and  most  are  negligible. 

If  the  line  of  sight  between  two  points  on  the  appendages  is  unobstructed  by  the  cylinder  approxi¬ 
mation  to  the  large  BOR,  the  first  two  interaction  types  can  be  computed  using  the  half-space  Green’s 
function.  This  is  done  by  finding  the  reflection  point  on  the  cylinder  and  computing  the  tangent  plane 
to  the  cylinder  at  that  point.  If  the  line  of  sight  between  the  two  appendage  points  is  obstructed  by 
the  cylinder,  both  of  the  first  two  interactions  are  zero.  For  many  problems,  modeling  the  first  two 
interactions  produces  acceptable  results.  If  more  accuracy  is  desired,  the  third  type  of  interaction  can 
be  computed  using  the  geometrical  theory  of  diffraction  (GTD)  [10].  Also,  if  the  appendages  are  near 
the  end  of  the  BOR,  it  may  be  necessary  to  compute  an  interaction  based  on  edge  diffraction  from  the 
nearby  end.  Note  that  the  effect  of  neglecting  some  of  the  interactions  is  to  neglect  the  corresponding 
field  which  is  scattered  by  the  appendages,  reflected  or  diffracted  back  to  the  appendages,  and  scattered 
by  the  appendages  again  [6]— [9]. 

2.2  Computation  of  scattered  field 

While  the  use  of  the  approximate  Green’s  function  allows  Eq.  (3)  to  be  solved  for  J,  the  approximation 
is  not  accurate  when  computing  E,s  from  Eq.  (4).  Therefore,  an  alternate  method  of  computing  Es  is 
used.  The  alternate  method  is  based  on  the  reciprocity  theorem.  Consider  a  short  dipole,  located  at 
point  r  and  oriented  in  the  u  direction.  From  the  reciprocity  theorem, 

p-jfcor  r  r 

E*(r)  ■  u  =  -jkovo-^r  j J  EWrO  •  J(r')<*S'  (5) 

$App 

where  ER0R  is  the  field  radiated  by  the  dipole  in  the  presence  of  the  large  BOR.  Recall  that  this  field 
can  be  computed  by  the  FEM.  In  fact,  when  backscattering  is  being  computed,  EROR  is  the  same  as 
EgQR  that  is  used  in  Eq.  (3).  Thus,  all  components  needed  to  compute  Es  using  Eq.  (5)  are  known. 

3  Numerical  Results 

Several  numerical  results  are  presented  to  show  the  validity  and  capability  of  the  hybrid  technique.  In 
all  of  the  results  presented,  the  direct  and  reflected  interactions  are  modeled  by  the  approximate  Green’s 
function,  and  all  other  interactions  are  neglected. 

First,  the  validity  of  the  technique  is  tested  by  computing  the  scattering  from  metallic  cylinders  with 
one,  two,  or  four  wings.  The  scattering  is  compared  with  computations  from  the  Fast  Illinois  Solver 
Code  (FISC)  [11],  which  is  an  MoM  program  that  uses  a  multilevel  fast  multipole  algorithm  to  speed  up 
the  matrix  solution.  The  cylinders  considered  have  a  radius  of  1.25A  and  height  of  5A,  where  A  is  the 
free-space  electromagnetic  wavelength.  The  attached  wings  are  1A  by  0.5A  by  0.025A. 

The  monostatic  RCS  as  a  function  of  azimuth  angle  from  the  cylinder  with  one  wing  is  shown  in 
Fig.  3.  The  agreement  between  FISC  and  the  hybrid  method  is  very  good.  The  scattering  from  the 
cylinder  without  the  wing  is  flat  as  a  function  of  azimuth  angle,  so  it  can  be  seen  that  the  wing  has 
added  approximately  5  dB  peak-to-peak  swing  to  the  RCS  in  the  VV-polarized  case  and  about  4  dB 
peak-to-peak  swing  in  the  HH-polarized  case.  Although  adding  a  second  wing  presents  the  possibility  of 
more  complex  interactions,  the  results  shown  in  Fig.  4  continue  to  show  good  agreement  between  FISC 
and  the  hybrid  method.  The  peak-to-peak  variation  in  the  RCS  is  about  4.5  dB  in  the  VV-polarized 
case  and  almost  6  dB  in  the  HH-polarized  case  when  two  wings  are  present.  The  scattering  from  the 
cylinder  with  four  wings  is  shown  in  Fig.  5,  where  the  agreement  between  FISC  and  the  hybrid  method 
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(a)  W-pol 


(b)  HH-pol 


Figure  3:  RCS  of  a  metallic  cylinder  with  a  wing.  The  cylinder  has  a  radius  of  1.25A  and  height  of  5A, 
and  the  wing  is  1A  by  0.5A  by  0.025A. 


(a)  VV-pol  (b)  HH-pol 

Figure  4:  RCS  of  a  metallic  cylinder  with  two  wings.  The  cylinder  has  a  radius  of  1.25A  and  height  of 
5A,  and  the  wings  are  1A  by  0.5A  by  0.025A. 


remains  very  good.  With  four  wings  present,  the  peak-to-peak  swing  in  the  RCS  is  about  5  dB  in  the 
VV-polarized  case  and  about  6  dB  in  the  HH-polarized  case. 

To  further  illustrate  the  capability  of  the  hybrid  method,  two  more  computed  results  are  presented. 
Both  of  these  results  involve  1-GHz  scattering  from  a  missile  with  appendages.  The  missile  is  12.5  m 
(41.7A)  long  and  has  a  radius  of  0.625  m  (2.1A).  For  the  first  case,  a  3-cm  (0.1A)  by  3-cm  (0.1A)  by 
8.125-m  (27.7A)  ridge  is  located  on  the  missile  at  azimuth  angle  0°,  and  computed  results  for  an  azimuth 
scan  and  for  an  elevation  scan  are  presented  in  Fig.  6.  Note  that  in  the  azimuth  scan,  the  ridge  causes  the 
scattering  to  vary  over  a  3.5-dB  range  in  the  W-polarized  case  and  over  a  3-dB  range  in  the  HH-polarized 
case  while  the  missile  alone,  because  of  its  rotational  symmetry,  has  a  constant  RCS  as  a  function  of 
azimuth  angle.  In  the  second  case,  two  0.375-m  (1.25A)  by  1-m  (3.3A)  fins  are  located  on  the  missile  at 
azimuth  angles  90°  and  —90°.  The  scattering  is  shown  both  for  an  azimuth  cut  and  for  an  elevation  cut 
in  Fig.  7.  In  the  azimuth  cut,  the  scattering  has  changed  from  flat  for  the  missile  alone  to  a  function 
with  a  3  dB  peak-to-peak  variation  in  the  W-polarized  case  and  a  2  dB  peak-to-peak  variation  in  the 
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(a)  VV-pol 


(b)  HH-pol 


Figure  5:  RCS  of  a  metallic  cylinder  with  four  wings.  The  cylinder  has  a  radius  of  X.25A  and  height  of 
5A,  and  the  wings  are  1A  by  0.5A  by  0.025A. 


(a)  Azimuth  cut  (b)  Elevation  cut 

Figure  6:  RCS  of  a  missile  with  a  ridge  at  1  GHz.  The  missile  is  12.5  m  (41.7A)  long  and  has  a  radius 
of  0.625  m  (2.1A).  The  ridge  is  3  cm  (0.1A)  by  3  cm  (0.1A)  by  8.125  m  (27.7A)  and  located  at  azimuth 
angle  0°. 
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Figure  7:  RCS  of  a  missile  with  two  fins  at  1  GHz.  The  missile  is  12.5  m  (41. 7A)  long  and  has  a  radius 
of  0.625  m  (2.1A).  The  fins  are  0.375  m  (1.25A)  by  1  m  (3.3A)  and  located  at  azimuth  angles  90°  and 
-90°. 

HH-polarized  case. 

4  Summary 

The  hybrid  method  is  a  useful  extension  of  the  FEM/BOR  capability  described  in  [1].  While  rigorous 
computation  of  scattering  from  a  BOR  with  appendages  requires  a  3-D  computational  method,  the  hybrid 
method  separates  the  BOR  part  of  the  problem  from  the  appendages.  The  rotational  symmetry  of  the 
BOR  part  of  the  problem  can  then  be  exploited  for  computational  efficiency  while  only  the  appendage 
part,  which  is  typically  much  smaller  than  the  BOR  part,  requires  a  3-D  method.  Numerical  results 
show  the  impact  of  the  appendages  on  the  scattering  from  the  entire  structure  and  verify  the  validity 
and  capability  of  the  hybrid  method. 
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I.  Introduction 

The  Complementary  Operators  Method  (COM)  has  been  successfully  used  for  open- 
region  mesh  truncation  in  the  Finite- Difference  Time-Domain  (FDTD)  solution  of 
wave  propagation  problems  [l]-[3].  The  underlying  mechanism  of  COM  is  two  aux¬ 
iliary  differential  operators,  dx  and  dt.  These  operators  are  applied  on  an  absorbing 
boundary  condition  (ABC)  such  as  Higdon,  Liao, ...etc..  The  purpose  of  these  two 
auxiliary  operators  is  to  produce  reflection  coefficients  that  are  180°  out  of  phase, 
not  only  in  the  analytic  domain,  but  also  in  the  discrete  domain.  By  averaging  the 
solutions  obtained  from  the  application  of  each  of  the  two  operators  on  an  ABC,  we 
arrive  at  a  new  solution  that  is  devoid  of  first-order  reflections. 

In  [lj-[3],  the  COM  theory  has  been  fully  developed  for  time-domain  simulation. 
In  the  frequency  domain,  a  difficulty  arises  when  obtaining  the  corresponding  dual 
of  the  two  auxiliary  operators  dx  and  dt-  This  difficulty  becomes  more  apparent 
when  performing  the  conversion  dt  —+  jiv.  In  this  work,  we  overcome  this  difficulty 
by  introducing  a  new  averaging  operator  which  is  analogous  to  dt  and  achieves  its 
objective  in  the  time  domain. 


II.  Frequency- Domain  Complementary  Operators 

We  consider  the  problem  of  wave  propagation  in  two-dimensional  space  (the  devel¬ 
opment  here  is  equally  applicable  to  the  three-dimensional  space).  Let  us  consider 
a  planar  outer  boundary  parallel  to  the  y-axis  at  x  =  0.  Let  B  deonte  an  ABC. 
Applying  B  on  the  field  u,  we  have: 


Bu  —  0 


(1) 
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The  first  of  the  two  complementary  operators  is  constructed  by  applying  dx  on  B: 


(dxB)u  =  0  (2) 

The  corresponding  reflection  coefficient  can  be  obtained  by  applying  dxB  on  a  plane 
wave  u  —  e~ikx  +  e*kx,  giving: 


R(dxB)  =  (-1  )R(B)  (3) 

In  the  discrete  domain,  however,  the  reflection  coefficient  due  to  dxB  is  affected  by 
the  discretization  of  the  domain  [2].  Therefore,  to  account  for  this  discretization,  we 
represent  the  field  as 


u  =  e~jkxiAx  +  ejkxiAx  (4) 

where  i  is  the  space  index  in  the  x-direction.  Approximating  the  partial  derivative  in 
(2)  as  a  backward  finite  difference,  we  have 


dx 


Dx  = 


( I-S -1) 

Ax 


(5) 


where  I  is  the  identity  operator  and  S  1  is  the  shift  operator.  Finally  substituting 
(5)  into  (3)  and  applying  the  resulting  operator  on  (4),  we  have 


R{8XB)  ->  R{DXB )  =  (-1  )ejkxAxRd{B)  (6) 


where  R?(B)  denoted  the  discrete  correspondence  of  R(B). 

For  the  complementary  operation  to  be  exact,  we  need  a  second  complementary 
operator  that  gives  a  reflection  coefficient  of  (— 1)R(DXB).  This  can  be  accomplished 
by  defining  a  new  discrete- domain  operator,  which  we  denote  by  Dx: 


Dx  = 


i+s-1 

Ax 


(7) 


Finally,  applying  JDX  on  B,  we  have 


R{DXB)  =  eikxAxRd{B)  (8) 

Equation  (8)  and  (6)  are  precisely  180°  out  of  phase;  hence,  full  complementariness 
is  achieved. 
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III.  Numerical  Experiment 


The  validity  of  the  development  above  is  demonstrated  by  a  numerical  experiment 
in  which  we  study  the  problem  of  scattering  by  a  1.95A  x  1.95A  perfectly  conducting 
square  cylinder.  The  outer  boundary  is  positioned  such  that  the  separation  between 
it  and  the  conductor  is  0.35A  as  shown  in  Fig.  1.  Figure  2  shows  the  magnitude  of  the 
electric  field  on  the  observation  contour  T  (see  Fig.  2)  as  calculated  using  the  FEM 
solution  A  total  of  164  nodes  span  the  observation  contour.  The  numbering  of  the 
nodes  starts  at  the  lower  left-hand  corner  and  proceeds  clockwise.  Results  axe  only 
shown  for  field  values  on  the  upper  half  of  the  contour  due  to  the  symmetry  of  the 
problem.  For  comparison,  the  Method  of  Moments  (MoM)  solution  is  also  provided 
for  this  problem.  The  FEM  solutions  were  obtained  using  Bayliss-Turkel  second-order 
operator  (B  =  (dn+jk)2),  Bayliss-Turkel  fourth-order  operator  ( B  =  (dn+jk)  ),  and 
the  fourth-order  COM  solution  (B  =  (dn+jk)*);  n  =  x,y.  (The  FEM  implementation 
for  the  COM  operators  follows  [4].)  The  numerical  results  show  very  strong  agreement 
between  the  COM  and  MoM  solutions. 


IV.  Conclusion 

This  paper  presented  the  development  of  frequency- domain  complementary  oper¬ 
ators.  The  application  requires  two  independent  solutions  of  the  problem.  Despite 
this,  however,  computer  memory  and  execution  time  can  be  saved  by  positioning  the 
outer  mesh-truncating  boundary  closer  to  the  conductor  than  what  would  be  required 
when  using  other  techniques. 
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outer  boundary 


Fig.  2.  Computational  domain  used  for  the  problem  of  scattering 
by  a  perfectly  conducting  square  cylinder. 


node  number 


Fig.  2.  Electric  field  for  the  problem  of  scattering  by  a  perfectly 
conducting  square  cylinder  shown  in  Fig.  1. 


270 


SESSION  5 


MODEL-BASED 

PARAMETER 

ESTIMATION 


Chairs:  Andreas  Cangellaris  and  Edmund  Miller 


271 


Model-Based  Frequency  Extrapolation  of  Antenna  Radiation 
Characteristics  on  Complex  Platforms 

Tao  Su,  Yuanxun  Wang  and  Hao  Ling 
Department  of  Electrical  and  Computer  Engineering 
The  University  of  Texas  at  Austin 
Austin,  TX  78712 


I.  Introduction 

In  antenna  design  and  analysis,  the  mounting  platform  can  have  a  significant  effect  on  the 
antenna  radiation  characteristics.  However,  rigorous  solution  of  the  radiation  problem  over  a  complex 
platform  is  very  time  consuming,  and  the  computation  complexity  increases  dramatically  as  the 
frequency  increases.  In  this  paper,  we  present  a  model-based  frequency  extrapolation  technique  with 
which  the  radiated  field  over  a  broad  band  of  frequencies  can  be  obtained  using  the  rigorously 
computed  results  at  low  frequencies. 

Our  approach  entails  three  steps.  First,  the  induced  current  on  the  platform  surface  is  computed 
at  low  frequencies  using  the  method  of  moments  (MoM).  Second,  we  apply  the  time-of-amval  model 
to  the  current  on  each  basis  element  on  the  surface.  The  model  coefficients  are  obtained  using  super¬ 
resolution  algorithm  ESPRIT  [1].  Finally,  the  induced  current  at  higher  frequencies  is  computed  using 
the  model  and  the  radiation  characteristics  are  calculated.  This  approach  is  similar  to  that  present  in  [2] 
and  [3]  for  radar  signature  extrapolation.  Our  results  show  that  when  the  frequencies  and  the 
discretization  of  the  platform  are  properly  chosen,  the  radiated  field  at  higher  frequencies  can  be 
extrapolated  with  only  moderate  computational  cost. 


II.  Extrapolation  Methodology 

As  the  first  step  of  the  extrapolation  process,  the  induced  current  on  the  target  is  computed  at  a 
small  set  of  points  at  low  frequencies.  Once  the  current  over  the  surface  is  computed  at  low 
frequencies,  we  apply  the  time-of-arrival  model  to  each  of  the  current  element.  In  this  model,  we 
assume  that  the  total  current  is  induced  by  different  scattering  mechanisms,  as  shown  in  Fig.  1.  Each 
of  the  incident  mechanism  has  a  distinct  arrival  time,  so  that  the  current  can  be  written  as 

J{C0)  =  ^ane~ia"  O) 

n= I 

where  (o  is  the  angular  frequency,  N  is  the  total  number  of  incident  mechanisms  and  tn  is  the  arrival 
time  of  the  n*  incident  mechanism. 

Since  the  incident  mechanisms  correspond  to  scattering  from  different  parts  of  the  target,  the 
maximum  difference  in  t„  is  related  to  the  size  of  the  target.  Thus  the  sampling  rate  in  the  frequency 
domain  should  be  high  enough  to  distinguish  these  time  events  from  all  parts  of  the  target.  Based  on 
this  consideration,  we  approximately  constraint  the  sampling  rate  in  frequency  to 
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(2) 


Af  <c/D 

where  c  is  the  speed  of  light  and  D  is  the  maximum  dimension  of  the  target. 

The  model  coefficients  an  and  t„  are  obtained  using  the  superresolution  algorithm  ESPRIT, 
which  is  based  on  the  model  that  the  signal  consists  of  a  sum  of  exponential  and  additive  white  noise. 
Given  a  sequence  with  M  samples,  the  algorithm  can  estimate  the  number  of  exponential  N  and 
determines  the  amplitude  and  period  of  each  exponential  term.  The  basic  requirement  in  the  number  of 
samples  is  M  >  2N  +  1.  Once  the  model  parameters  are  found,  the  induced  current  at  higher 
frequencies  can  be  computed  using  (1).  The  radiated  field  is  then  easily  obtained  from  the  extrapolated 
current. 


III.  Results 

As  an  example,  we  consider  a  2-D  structure  as  shown  in  Fig.  2  (a).  We  are  interested  in  the 
radiation  pattern  over  a  frequency  band  from  0.15  to  0.45  GHz.  To  obtain  the  data  for  the 
extrapolation,  we  compute  the  induced  current  at  10  frequencies  from  0.15  to  0.24  GHz.  Then  we  use 
the  ESPRIT  algorithm  to  obtain  the  model  coefficients  for  each  current  element  and  compute  the 
radiated  field  based  on  the  model.  Fig.  2(a)  shows  the  radiated  field  of  a  horizontally  polarized  line 
source  as  a  function  of  frequency  at  an  elevation  angle  of  40°.  The  dashed  curve  is  obtained  from  the 
model-based  extrapolation  while  the  solid  curve  is  the  reference  brute-force  solution.  We  observe  that 
the  extrapolation  algorithm  correctly  predicts  the  peaks  and  null  positions  in  frequency,  indicating  a 
good  estimate  on  the  times-of-arrival.  Fig.  2(b)  is  the  time  domain  response  obtained  via  an  inverse 
Fourier  transform  of  the  frequency  data.  The  first  large  peak  corresponds  to  the  specular  scattering 
from  the  flat  surface  and  the  second  large  peak  is  due  to  the  scattering  from  the  step  region. 

We  notice  from  Fig.  2(a)  that  the  model-predicted  field  matches  well  with  the  reference  field  at 
the  first  10  frequencies,  but  drops  below  the  computed  field  as  frequency  goes  higher.  This  is  because 
the  intensity  of  the  field  radiated  by  the  line  source  is  proportional  to  the  square  root  of  the  frequency. 
We  can  overcome  this  by  compensating  this  effect  before  doing  the  extrapolation.  Thus  the  time-of- 
arrival  model  becomes 


where  should  be  replaced  by  co  for  3-D  situations.  After  the  compensation,  the  extrapolation  result 
is  further  improved,  as  shown  by  the  frequency  and  time  responses  in  Figs.  3(a)  and  3(b). 

Finally  we  look  at  the  radiation  problem  of  the  3-D  platform  in  Fig.  4(a).  The  source  is  a 
horizontally  polarized  dipole.  The  solver  used  is  FISC  [4],  which  is  a  3D  MoM  code  based  on  the  fast 
multipole  method.  Similar  to  the  2-D  case,  we  use  the  computed  current  at  10  frequencies  from  0.15  to 
0.24  GHz  to  extrapolate  the  data  to  0.45  GHz.  The  extrapolated  radiation  field  as  a  function  of 
frequency  at  the  elevation  angle  of  40°  is  shown  in  Fig.  4(b)  as  the  dashed  curve.  The  reference  brute- 
force  solution  is  plotted  as  the  solid  curve.  The  major  features  of  the  radiated  field  are  well 
characterized  by  the  extrapolation  algorithm. 
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IY.  Conclusion  and  Discussion 


As  we  have  seen  from  the  results,  the  frequency  extrapolation  technique  is  an  efficient  way  of 
obtaining  the  radiation  pattern  over  a  broad  band  of  frequencies.  Computation  time  is  reduced 
dramatically  since  the  current  is  rigorously  solved  only  at  low  frequencies.  We  improved  the  result  by 
compensating  the  frequency  factor  of  the  source  in  the  time-of-arrival  model.  This  indicates  that  a 
wrong  frequency  dependence  in  the  model  may  result  in  errors  in  the  model  coefficients.  In  addition  to 
the  frequency  factor  of  the  source,  the  frequency  dependence  of  different  scattering  components  of 
each  current  element  could  be  different,  due  to  different  scattering  physics.  Further  incorporation  of 
these  effects  should  further  enhance  the  accuracy  of  the  extrapolation. 
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Fig.  1.  Time-of-arrival  model 
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(a)  Geometry  of  the  3-D  platform 


Frequency  (GHz) 

(b)  Frequency  response 

Fig.  4.  Frequency  extrapolation  for  3-D  radiation.  -  FISC 

- FISC  at  10  point  +  extrapolation 


277 


DEVELOPING  ADAPTIVE  MODELS  AND  ESTIMATING  THE 
UNCERTAINTY  OF  PRESAMPLED  SPECTRAL  DATA 


Edmund  K.  Miller 

3225  Calle  Celestial,  Santa  Fe,  NM  87501-9613 
505-820-7371 ,  emiller@esa.lanl.gov 


0.  ABSTRACT 

Model-based  parameter  estimation  (MBPE)  provides  a  means  of  more  efficiently  describing 
observables  such  as  frequency,  or  spectral,  responses  that  are  obtained  from  generating 
models  (GM)  like  NEC  through  use  of  reduced-order  fitting  models  (FM)  derived  from  the 
problem  physics.  Besides  decreasing  the  amount  of  data  that  is  needed  to  develop  a  con¬ 
tinuous,  analytical  representation  of  a  response,  these  FMs  can  also  be  used  to  estimate 
the  quantitative  uncertainty  of  the  GM  samples  themselves.  These  ideas,  previously  il¬ 
lustrated  for  sampling  “on-the-fly”,  i.e.,  at  the  time  the  data  samples  are  being  computed,  are 
extended  here  for  application  to  presampled  data. 

1.  INTRODUCTION 

Use  of  MBPE  has  been  previously  described  [Miller  (1998)]  as  a  way  to  adaptively  sample 
frequency  spectra,  as  a  means  of  reducing  the  number  of  frequencies  at  which  a  GM  com¬ 
putation  is  needed.  This  is  made  possible  using  a  rational  function  (as  a  generalization  of  a 
pole  series),  reduced-order,  FM  to  interpolate  between  the  GM  samples.  Another  use  for 
such  FMs  is  that  of  estimating  the  uncertainty  of  data  that  is  sufficiently  oversampled,  either 
when  being  generated  or  only  available  in  presampled  form  [Aly  and  Wong  (1989)],  in  a 
fashion  analogous  to  using  linear  regression  for  data  described  by  a  straight  line  of  un¬ 
known  slope  and  y-axis  intercept.  These  two  ideas  are  further  developed,  specifically  for 
presampled  data,  in  this  discussion. 

2.  ADAPTIVE  SAMPLING  AND  UNCERTAINTY  ESTIMATION 

2.1  Adaptive  Sampling  Based  on  FM  Mismatch 

Since  the  basic  approach  has  been  described  elsewhere  [Miller  (1998)],  the  idea  of 
achieving  reduced-order  EM-data  descriptions  using  MBPE  is  only  briefly  outlined  here.  A 
mismatch  error,  or  error  measure 

AMEi  j(yk)  -  {JIMj(yk)  -  Mj(yk)l]/[IMi(yk)l  +  IMj(yk)l]>  (1 ) 

is  computed  for  each  pair  of  overlapping,  rational-function  FMs  Mj  and  Mj  at  sample  loca¬ 
tions  yk,  k  =  1 . Y  to  find  the  maximum  mismatch  error  (MME)  between  FMs  i  and  j,  MEjj[_ 

=  max{AM5j(yk)}  at  y  =  yL.  Location  of  the  next  GM  sample  is  then  chosen  to  maximize  the 
added  information  it  provides  by  adding  it  at  yL,  where  max{ME|jii  is  the  overall  maximum- 
mismatch  error  for  all  i,j,L  comparisons.  The  true  error  of  FMj  at  yL  is  also  then  becomes 
available  as  AGEj  L  before  FMj  has  been  updated  using  GML.  Sampling  of  the  GM  would  be 


concluded  when  the  MME  £  10"C,  where  C  is  the  convergence  desired.  In  using  this  ap¬ 
proach,  the  rank  of  an  individual  FM  can  also  be  maintained  below  some  maximum  value  to 
circumvent  possible  ill  conditioning  (the  effects  of  which  can  also  be  mitigated  by  singular- 
value  decomposition).  If  the  rank  of  a  FM  has  increased  to  a  specified  threshold  as  a  result 
of  adding  new  GM  samples  in  its  window,  it  is  divided  into  two,  new  overlapping  FMs  of  low¬ 
er  order.  This  approach  is  illustrated  conceptually  in  Fig.  1. 


Figure  1 .  The  possibility  of  developing  a  FM-representation  of  a  transfer  function  over  a  wide  frequency  interval 
by  employing  a  number  of  subinterval,  overlapping,  lower-order  FMs,  Mj,  is  illustrated  conceptually  here  (a). 
Each  additional  GM  sample  used  for  the  FM  computation  [the  x’s  in  (b)]  is  taken  where  the  maximum  FM-FM  mis¬ 
match  has  been  found,  continuing  until  the  overall  mismatch  falls  below  a  specified  estimation  uncertainty.  This 
results  in  a  generally  non-uniform  placement  of  the  GM  samples.  For  presampled  data  the  x’s  are  restricted  to 

the  available  sample  locations. 

2.2  Estimating  Data  Uncertainty 

MBPE  and  reduced-order  FMs  can  also  be  used  to  estimate  the  uncertainty  of  EM 
data,  by  which  we  mean  the  relative  inconsistency  of  data  obtained  from  a  process  that 
should  satisfy  Maxwell’s  equations.  The  procedure  outlined  here  is  intended  to  establish  a 
bound  on  data  uncertainty,  for  example  to  show  that  the  data  is  uncertain  to  no  more  than 

one  part  in  104.  As  for  the  case  of  adaptive  sampling,  overlapping  FMs  may  also  be  em¬ 
ployed,  with  the  major  differences  being  that  the  GM  data  must  be  adequately  oversampled 
and  in  how  that  data  is  used  to  quantify  the  FMs.  At  least  three  different  approaches  might 
be  employed  for  estimating  data  uncertainty: 

1)  By  varying  the  parameters  [order  of  the  numerator  polynomial  (n)  and  denomina¬ 
tor  polynomial  (d),  and  window  width  (W)]  of  a  single  FM,  a  best-fit  is  sought  between 
that  FM  and  the  unused  GM  data  in  that  window. 

2)  Successive  FMs  are  computed  using  fixed  n,  d  and  W  values  from  GM  data  that 
has  additive  random  noise  of  systematically  reducing  amplitude  until  no  further 
change  is  found  in  the  mismatch  error  between  the  FMs  and  unused  GM  data. 

3)  The  data  matrix,  either  square  or  over-determined,  is  solved  using  a  pseudo-i 
verse  solution  from  which  a  mismatch  error  between  all  of  the  GM  data  spanned  by 
that  FM  representation  is  obtained. 
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The  above  approaches  for  adaptive  sampling  and  uncertainty  estimation  of  presampled  GM 
data  are  illustrated  below. 

3.  EXAMPLES  OF  ADAPTIVELY  MODELING  PRESAMPLED  DATA 

Previous  examples  of  MBPE  adaptive  sampling  were  confined  to  perfect  electric  conductors 
(PEC)  [Miller  (1998)].  The  reasonable  question  might  arise  about  whether  the  approach 
works  comparably  well  for  dielectric  and  lossy  objects,  since  a  rationale’  for  the  procedure 
is  provided  by  SEM  (Singularity  Expansion  Method)  pole-based  descriptions.  The  poles  for 
a  given  PEC  object  and  one  of  identical  geometry  but  that  is  dielectric  or  lossy  can  possibly 
be  quite  different,  e.g.,  branch  cuts  can  be  introduced  [Aly  and  Wong  (1989)].  As  a  matter  of 
fact,  I  had  been  contacted  [Cloete  (1998)]  about  some  apparent  poor  results  resulting  from 
attempting  to  mode!  the  frequency  response  of  microstrip  antennas  on  a  lossy  substrate. 
Therefore,  it  appeared  timely  to  test  the  adaptive-sampling  technique  for  the  non-PEC  ap¬ 
plication. 

Not  having  a  program  to  generate  such  results,  I  posted  a  request  to  NEC-LIST  to  see 
whether  anyone  might  be  able  to  provide  me  the  needed  data.  [NEC-LIST  is  a  BBS  for  ex¬ 
changing  EM  information  with  a  computational  focus;  it  doesn’t  deal  just  with  NEC-related 
issues.  For  information  about,  or  to  join  NEC-LIST,  contact  Dave  Michelson, 
davem@ee.ubc.ca].  This  request  generated  several  responses,  and  soon  thereafter 
Michael  Kluskens  of  the  Naval  Research  Laboratory  [Kluskens  (1998)]  sent  me  some  data 
files  via  e-mail.  It’s  that  data  that  is  modeled  in  Figs.  1-2. 

The  major  difference  between  sampling  a  GM  on  the  fly  and  handling  pre-sampied  GM  data 
is,  of  course,  that  the  data  samples  used  to  determine  the  mismatch  between  overlapping 
fitting  models  (FMs)  cannot  be  arbitrarily  located,  but  are  limited  to  those  frequencies  at 
which  GM  samples  are  available.  Thus,  it’s  necessary  that  the  GM  data  be  sufficiently  sam¬ 
pled  that  the  FMs  converge  to  the  desired  degree  before  exhausting  the  available  data.  In 
this  case,  the  Kluskens-supplied  data  was  sampled  at  5-MHz  increments  from  5  to  495 
MHz,  for  the  backscattered  field  from  an  infinite,  circular  cylinder  of  radius  a  =  0.159  m,  of 
relative  permittivity  16  (Fig.  2)  and  16  -  ]16  (Fig.  3),  respectively.  Five  initial  FMs  and  eight 
GM  samples  were  used  in  each  case,  spanning  samples  1-4,  1-5,  3-6,  4-8,  5-8.  Additional 
GM  samples,  shown  by  the  solid  circles,  were  used  from  the  pre-sampled  set  as  needed 
until  a  maximum,  normalized  FM-FM  mismatch  of  0.01  was  achieved.  If  adding  a  new  GM 
samples  to  a  given  FM  resulted  inn+d  +  1>18,  it  was  divided  into  two  new  overlapping 
models,  resulting  in  a  total  of  6  FMs  for  Fig.  2  where  one  new  FM  was  divided  and  5  for  Fig 

3,  where  this  wasn’t  necessary.  Generally  speaking  the  added  GM  samples  in  both  cases 
are  located  where  the  response  is  changing  most  rapidly,  with  the  dielectric  case  requiring 
more  points  overall  because  of  its  greater  complexity.  The  GM  and  FM  results  are  generally 
graphically  indistinguishable  except  in  regions  of  rapid  change  where  the  average  FM  is 
sampled  more  finely  than  the  precomputed  GM  data. 

4.  EXAMPLES  OF  ESTIMATING  DATA  UNCERTAINTY 

Using  a  FM  to  estimate  data  uncertainty  requires  that  “sufficient”  GM  data  is  available.  For 
example,  in  using  either  approach  (1)  or  (2)  in  Section  2.2,  some  of  the  GM  data  is  used  to 
quantify  a  FM  whose  match  to  the  remaining  GM  data  then  provides  the  uncertainty  test. 
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Approach  (3)  instead  employs  all  of  the  available  data,  both  to  obtain  a  FM  and  to  estimate 
the  data  uncertainty.  Since  the  accuracy  of  the  FM  representation  depends  on  both  its  pa¬ 
rameters  and  the  data  being  tested,  it’s  advisable  to  vary  the  former  and  to  observe  the 
trends  in  the  match  between  the  FM  predictions  and  GM  data  (called  here  the  data  fit). 
Based  on  computer  experiments  using  various  data,  the  best  combination  of  FM  parameters 
and  GM  data  seems  to  be  n  =  7,  d  =  6  and  a  W  that  spans  about  two  resonance  peaks 
[Miller  (1998)].  Some  examples  of  various  uncertainty  tests  follow. 

4.1  Uncertainty  Testing  Using  a  Variable-Pamameter  Fitting  Model 

Two  examples  of  testing  the  uncertainty  of  presampled  data  using  approach  (1)  of 
Section  2.2  are  presented  in  Fig.  4a,  the  Kluskens  data,  and  Fig.  4b,  data  provided  by  Clo- 
ete  for  the  microstrip  antenna.  In  both  cases,  a  single  FM  is  used  having  parameters  n  =  2,. 

.  .,8  and  d  =  1 ,  .,7  with  W  s  42  so  that  at  most  only  every  other  GM  sample  was  used  for  the 
FM  computation.  It  can  be  observed  that  in  Fig.  4a  the  maximum  average  FM-GM  match  is 
about  6.5  digits  whereas  that  in  Fig.  4b  is  about  3.8  digits,  indicating  uncertainties  of  no 
worse  than  3x1 0"?  and  1.6x1  O'4  respectively. 

4.2  Uncertainty  Testing  Using  Additive  Noise 

A  further  test  of  the  Kluskens  data  is  illustrated  in  Fig.  5  where  four  FMs  are  employed 
having  n  =  7  and  d  =  6  for  models  1  and  4  and  n  =  7  and  d  =  7  for  models  2  and  3,  to  ensure 
overlap  of  at  least  two  FMs  everywhere,  and  where  alternate  GM  samples  are  reserved  for 
testing  the  data  fit.  The  average  FM  data  fit  is  shown  as  a  function  of  frequency  with  the  pa¬ 
rameter  L  denoting  the  level  of  the  random  noise  of  maximum  value  10'L  added  to  the  GM 
samples.  For  values  of  L  up  to  6,  the  individual  data-fit  values  vary  about  an  average  of  ap¬ 
proximately  L,  but  for  L  =  7  and  8  saturate  at  about  6.5,  consistent  with  the  results  seen  in 
Fig.  4. 


4.3  Uncertainty  Testing  Using  an  Overdetermined  System 

The  result  in  Fig.  6,  using  both  the  Kluskens  and  Cioete  data,  is  for  the  case  where 
27  successive  GM  samples  are  used  for  the  same  four  FMs  as  used  for  Fig.  5,  but  with  each 
FM  data  matrix  now  overdetermined  by  10  GM  samples.  It  can  be  seen  that  the  FM-GM 
match  oscillates  around  an  average  of  about  6.8,  again  consistent  with  previous  results. 
The  indication  is  that  the  Kluskens’  data  has  a  normalized  uncertainty  no  worse  than  2  to 
3x1  O'7.  The  corresponding  value  for  the  Cioete  data  from  this  test  indicates  an  uncertainty 
of  somewhat  less  than  10'4,  somewhat  higher  than  implied  by  Fig.  4b. 

4.4  Examples  of  Estimating  the  Uncertainty  of  Experimental-Data 

Examples  of  using  MBPE  to  estimate  the  uncertainty  of  experimental  data  are  pre¬ 
sented  in  Figs.  7-9  using  results  measured  at  the  David  Florida  Laboratory  [Mishra  (1996)] 
for  the  field  backscattered  from  a  PEC  cube  .  The  actual  data  is  comprised  of  801  samples 
from  2  to  18  GHz.  Initially,  complex  data  was  employed,  yielding  the  results  shown  in  the 
Fig.  7,  where  rather  poor  agreement  is  obtained  between  the  FM  (n  =  7,  d  =  6)  and  the 
measured  data. 

Anticipating  that  the  data  might  be  known  more  accurately  in  magnitude  than  in  phase 
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(which  was  used  to  obtain  the  rea!  and  imaginary  components  were  ),  the  FM  was  recom¬ 
puted  using  only  the  magnitude  data  instead,  with  the  outcome  shown  in  Fig.  8.  For  sim¬ 
plicity,  the  computation  continues  to  use  a  complex  FM  but  since  the  input  data  is  now  pure 
real,  the  imaginary  part  of  the  FM  is  automatically  zero.  Graphically,  it’s  clear  that  the  match 
obtained  between  the  FM  and  magnitude  data  is  much  better  than  when  using  the  complex 
data. 


Figure  2.  Adaptive  sampling  using  precomputed  data  (from  Kluskens)  for  the  scattered  field  from  an  infinite, 
circular  cylinder  whose  relative  permittivity  is  16  is  illustrated  here.  The  precomputed  GM  samples,  spaced 
at  5-MHz  intervals,  are  joined  by  a  solid  line  while  the  average  FM  results  are  shown  by  the  dotted  line.  The 
GM  samples  used  for  the  original  FM  computation  are  shown  by  open  circles  while  the  samples  added  dur¬ 
ing  the  adaptation  process  are  indicated  by  solid  circles. 


Figure  3.  Adaptive  sampling  using  precomputed  data  (from  Kluskens)  for  the  scattered  field  from  an  infinite, 
circular  cyfinder  whose  relative  permittivity  is  16  -  j16  is  illustrated  here.  The  precomputed  GM  samples, 
spaced  at  5-MHz  intervals,  are  joined  by  a  solid  line  while  the  average  FM  results  are  shown  by  the  dotted 
line.  The  GM  samples  used  for  the  original  FM  computation  are  shown  by  open  circles  while  the  samples 
added  during  the  adaptation  process  are  indicated  by  solid  circles. 

This  is  confirmed  by  the  results  of  Fig.  9a,  where  the  mismatch  between  the  original  data 
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and  these  two  FMs  is  shown,  indicating  that  the  uncertainty  of  the  measured  magnitude 
data  appears  to  be  about  1  per  cent  on  average.  It  appears  reasonable  to  also  conclude 
from  these  results  that  the  error  in  the  measured  phase  is  substantially  greater  than  that  of 
the  magnitude. 


Figure  4.  Average  data  fit  between  a  FM  of  varying  n  and  d  and  GM  samples  for  the  Kluskens  data  (a)  and  the 
Cloete  data  (b).  These  results  indicate  that  for  (a)  the  GM  uncertainty  is  no  worse  than  about  6.5  digits  (3x1 0"7) 

and  for  (b)  about  2x1 0*4. 


Rgure  5.  Data  fit  between  average  FM  and  GM  sam¬ 
ples  as  a  function  of  normalized  frequency  for  the 
Kluskens  data  with  additive  random  noise  of  maximum 
value  10'k  The  data  fit  is  seen  to  saturate  at  about  6.5 
digits  for  L  >  6. 
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Figure  6.  Data  fit  between  the  average  FM  and  GM 
samples  as  a  function  of  normalized  frequency  for  the 
Kluskens  data  (solid  symbols)  and  Cloete  data  (open 
symbols).  Each  computation  employed  four  FMs  and 
10  extra  GM  samples  with  the  over-determined  data 
matrix  solved  using  a  pseudo  inverse. 
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Figure  7.  Resutts  obtained  from  a  1 4-coefficient  complex  FM  with  n  =  7  and  d  =  6  (the  dark  solid  line)  using 
data  samples  from  a  PEC  cube  (shown  by  the  solid  circles),  with  the  narrow  line  showing  the  original  data 
(from  Mishra).  Although  the  FM  seems  to  average  the  oscillations  seen  in  the  experimental  data,  the  overall 

agreement  is  poor. 


Finally,  the  results  of  Fig.  9b  demonstrate  the  relative  insensitivity  to  the  width  of  the  FM  fre¬ 
quency  window,  using  in  all  cases  a  magnitude-only  FM  having  n  =  7  and  d  =  6.  Despite 
varying  the  window  width  from  100  to  300,  the  uncertainty  estimates  in  the  frequency  inter¬ 
vals  where  the  different  windows  overlap  are  quite  similar  on  average.  Data  points  in  both 
Figs.  8  and  9  that  are  near  zero  indicate  that  there  is  essentially  no  agreement  between  the 
FMs  and  the  measured  data. 


Figure  8.  Repeat  of  Fig.  7  but  using  the  magnitude  of  the  measured  data  as  the  input.  The  data  fit  is  much 
better  here  than  when  using  complex  input  data. 
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5.  CONCLUDING  COMMENTS 

The  use  of  model-based  parameter  estimation  (MBPE)  to  develop  adaptive,  windowed,  ra¬ 
tional-function  fitting  models  for  presampled  electromagnetic  data,  and  for  estimating  the 
uncertainty  of  such  data  has  been  outlined  and  illustrated  here.  For  the  former  application, 
MBPE  is  advantageous  in  permitting  a  reduced-order,  analytically  continuous  approxima¬ 
tion  to  the  discretely  sampled  data  to  be  realized.  For  the  latter,  the  MBPE  procedure  per¬ 
mits  the  uncertainty  of  the  presampled  data  to  be  estimated  in  a  manner  analogous  to  linear 
regression  when  data  is  described  by  a  straight  line. 
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(a)  (b) 

Figure  9.  The  match  in  digits  obtained  using  the  complex  FM  (solid  symbols)  and  magnitude-only  FM  (open 
symbols)  (a)  and  match  in  digits  obtained  using  a  magnitude-only  FM  of  various  window  widths  (b)  for  the  experi¬ 
mental  field  scattered  from  a  PEC  cube  (from  Mishra),  in  each  case  using  n  =  7  and  d=  6.  Average  agreement  for 
the  magnitude-only  FM  in  (a)  is  about  2  digits,  with  minima  occurring  in  the  vicinity  of  minima  in  the  RCS.  Window 
widths  were  used  in  (b)of  500-800  (solid  circles),  550-800  (open  circles),  600-800  (open  squares),  650-800 
(open  triangles),  and  700-800  (solid  squares). 
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ABSTRACT 

A  hybrid  method  for  the  simultaneous  interpolation  of  antenna  radiation  patterns  in  the  spatial  and  frequency  domains  is 
presented  which  combines  the  Pade  rational  function,  a  common  MBPE  fitting  model,  with  a  polynomial  function  to 
account  for  spatial  variation.  One  of  the  most  attractive  features  of  this  method  is  that  a  single  set  of  previously  determined 
interpolation  coefficients  can  be  used  to  reproduce  an  entire  radiation  pattern  for  a  particular  antenna  at  any  frequency 
within  its  operational  range,  and  with  any  desired  angular  resolution.  Two  methods  for  performing  this  hybrid 
interpolation  will  be  presented  and  discussed  in  this  paper.  The  first  uses  a  generalized  form  of  the  Pade  rational  function 
and  a  multiple-domain  sampling  scheme  to  interpolate  over  the  entire  range  of  interest  direcdy,  and  the  second  approach 
involves  using  a  low-order  (3-point)  piece-wise  interpolation  scheme  that  only  requires  sampling  in  the  spatial  domain. 
The  two  methods  are  applied  to  a  0.5  meter  dipole  modeled  from  150  to  950  MHz  and  from  0°  to  90°  in  0.  This  example  is 
used  to  illustrate  advantages  and  disadvantages  in  the  use  of  both  of  these  methods. 

L  INTRODUCTION  t  ,  , 

The  generation  and  storage  of  high-fidelity  data  for  wideband  antenna  radiation  patterns  can  place  a  significant  burden  on 
present-day  computational  platforms.  For  this  reason,  a  technique  which  can  quickly  and  efficiently  interpolate,  store,  and 
reconstruct  antenna  radiation  patterns,  in  both  the  spatial  and  frequency  domains  and  with  any  desired  fidelity,  is  highly 
desirable  An  interpolation  scheme  based  on  Model-Based  Parameter  Estimation  (MBPE)  is  well-suited  for  this  problem 
because  fitting  equations  that  are  model-based  tend  to  greatly  reduce  the  required  order  of  the  interpolation,  and  thus  the 
amount  of  data  that  needs  to  be  stored.  A  series  of  articles  by  Miller  [1-3]  describes  in  detail  the  principles  behind  MBPE, 
including  a  discussion  of  several  model  equations  and  their  applications  to  a  wide  variety  of  EM  problems.  The  model- 
based  equation  (fitting  model)  considered  in  this  paper  is  the  Pade  rational  function,  which  has  been  used  successfully  to 
interpolate  antenna  input  impedance  as  a  function  of  frequency  [1-3],  and  also  for  the  electric  field  frequency  response  of 
antennas  [4-5]  The  conventional  form  of  this  function  containes  no  information  about  the  position  in  space  at  which  the 
data  is  being  interpolated,  so  it  cannot  be  used  to  predict  how  changes  in  spatial  position  affect  the  structure  of  the 
spectrum  A  new  hybrid  MBPE  technique  was  presented  in  [6]  which  included  spatial  dependence  in  the  numerator  and 
denominator  coefficients  of  the  Pad 6  rational  function.  This  enhancement  allows  the  simultaneous  interpolation  of  antenna 
radiation  patterns  in  both  the  spatial  and  frequency  domains  such  that  only  a  single  set  of  coefficients  is  required  to 
reconstruct  a  radiation  pattern  at  any  desired  frequency  within  the  fitting  model  range.  A  procedure  for  using  MBPE 
interpolated  antenna  input  impedance  as  a  function  of  frequency  to  transform  the  interpolated  electric  fields  into 
corresponding  gain  patterns  at  any  frequency  was  also  introduced  in  [4-6], 

The  technique  of  simultaneous  interpolation  described  in  [6]  was  found  to  produce  excellent  results,  with  significant 
pattern  compression  ratios  and  only  small  errors  in  the  interpolated  gain  patterns.  Its  only  drawbacks  involve  the 
complexity  of  the  matrix  equations  needed  to  perform  the  interpolations,  and  the  amount  of  calculation  time  required  to 
construct  these  matrices.  This  paper  presents  and  discusses  a  modification  to  the  procedure  outlined  in  [6],  which 
incorporates  piece-wise  spectral  domain  interpolation,  with  spatial  dependence  included,  to  model  antenna  radiation 
patterns  over  large  bandwidths.  This  method  only  requires  direct  interpolation  in  the  spatial  domain,  thereby  using 
relatively  small  matrices  and  requiring  much  less  computation  time.  It  also  allows  the  models  used  to  incorporate  spatial 
dependence  to  be  interchanged  quickly  and  efficiently  without  requiring  that  the  entire  problem  be  recalculated.  Finally, 
this  piece-wise  procedure  could  be  used  in  conjunction  with  the  MBPE  Windowed  Adaptive  Sampling  procedure  described 
by  Miller  [7-8]  to  determine  optimum  frequency  sampling  points  for  interpolating  complex  electric  field  spectra. 
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a  THEORY 

The  standard  form  of  the  Pade  rational  function  is  given  by  [1-3] 

N0+Nys+N2s2  +  -+Nnsn 

r  (j)  **  fy  \ 

Do+DyS+D2s  +,w+D<j_jS  +s 

(1) 


which  has  a  total  of  n  +  d  +  1  unknown  numerator  and  denominator  coefficients.  For  the  interpolation  of  antenna  radiation 
pattern  frequency  spectra,  F(s)  is  defined  as  the  set  of  complex  frequency-dependent  MoM  electric  field  data,  and  s  is  the 
complex-valued  frequency,  j(n.  However,  this  form  of  (1)  is  limited  to  interpolating  frequency  spectra  at  single  points  in 
space  [4-5],  and  does  not  contain  any  information  on  the  spatial  variation  of  the  spectrum.  A  more  general  form  of  the 
Pade  rational  function  has  been  introduced  in  [6]  which  is  given  by 


F(s,B)  = 


N0(e)+N1(e)s+N2(ey  +•••+#„  (e>" 
D0(e)+Dl(e)s+D2(ey +...+Dd^(e  y~l  +x' 


(2) 


where  now  the  numerator  and  denominator  coefficients  are  not  only  dependent  on  the  complex  frequency  s  but  also  on  the 
spatial  variable  0.  This  form  of  the  Pad6  rational  function  is  capable  of  interpolating  an  antenna  radiation  pattern 
simultaneously  in  both  the  spatial  and  frequency  domains.  The  spatial  dependence  was  modeled  in  [6]  by  assuming  that 
each  numerator  and  denominator  coefficient  could  be  represented  by  a  polynomial  expansion  in  0  of  the  form 


N0{ey  n$  +n10b+n^b2+-+ n£b  * 
Ny  (e ) = jv,° + n\b + Nfs 2  +  ■•  • + w,*  e k 

Nn(e)=N^+N1nB  +  N262  +-+Nk8k 
D0  (&)=  Do  +  D\6 + D%8 2  + -  +  Dfc  k 
D]  (e)=  D,°  +D\e  +  Dle2  +---  +  Z>,*6>* 


*V.i(e)=02-i  +D1d_lB  +  D2_1B2  +-+Dk_l8k 


(3) 


where  k  is  the  order  of  the  polynomial.  Under  these  conditions,  (2)  has  (n  +  d  +  1)  x  (k  +  1)  unknown  complex 
coefficients.  The  procedure  for  determining  these  coefficients  is  to  sample  the  MoM  data  set  at  Nfx  NQ  points,  write  this 
sampled  data  in  the  form  of  a  matrix  equation  using  the  Pad6  rational  function,  and  invert  the  matrix  equation  to  solve  for 
the  unknown  parameters  [6].  Here,  NQ  is  the  number  of  spatial  sampling  points  taken  at  each  of  the  Nf  frequencies,  and  the 
dimensions  of  the  resulting  matrix  that  requires  inversion  is  [(n  +  d+  1)  x  (fc+1)]  by  (Nfx  N6).  It  should  be  noted  that  this 
procedure  can  be  extended  to  included  spatial  dependence  in  two  dimensions,  for  both  0  and  4>,  by  using  a  binomial  spatial 
expansion  for  each  numerator  and  denominator  coefficient  of  the  Pade  rational  function  [6].  For  the  purpose  of  this  paper, 
however,  only  one-dimensional  spatial  dependence  will  be  considered. 
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A  new  technique  for  performing  the  same  simultaneous  interpolation  described  above  is  presented  here  which  has  several 
advantages  over  the  generalized  procedure.  It  involves  using  interpolation  "windows",  consisting  of  only  three  fitting 
frequencies,  and  incorporating  a  piece-wise  scheme  for  interpolating  over  large  frequency  ranges.  The  Pade  rational 
function  of  orders  n  -  1  and  d=  1  with  spatial  dependence  included  is 


F(s,0)= 


N0(e)+N]{e)s 

D0(e)+s 


(4) 


By  evaluating  (4)  at  three  different  sampling  frequencies,  we  arrive  at  the  following  three  equations  which  are  dependent 
upon  the  spatial  variable  8: 


fi(o)= 


iyQ(g)+*i(g>i 

Z)0(8)+  $1 


/2(*)= 


N0(9)+Nl(9)s2 

D0(d)+s2 


Mob 


N0(e)+Nl(9)si 

D0{0)+s  3 


(5) 


where  it  is  assumed  that  ftf)  exactly  represents  the  spatial  radiation  pattern  at  the  frequency  s,-  for  /  -  1,  2  and  3.  The 
expressions  in  (5)  may  be  rearranged  and  written  in  matrix  form  as 

'l  -/,(e)1pvo(0)l  \MoW 
1  *2  -Mo)  Mo]  =  Mob 

1  *3  -/3(0)JK(0)j  U(0>3j 

(6) 

Hence,  since  this  only  represents  a  3-by-3  system,  the  vector  of  unknown  coefficients  may  be  solved  for  directly  via 
Cramer's  Rule  [9],  which  yields  the  following  expressions  for  the  three  Padd  coefficients: 


xr  /.x  f, (9)Mo\s^ -s2si]+Mo)Moisis2  (o)Mois2^ -*1*2} 

N°'e'-  fA9h7 -sA+fMs,  -sA+Mob  -s2l 


MpWpU -sMMobioh -s2hMo)Mofo  -M 

'  f\{d\s2  -S3J+/2(e)ty3  -5iJ+/3(0Xj1  -*l\ 


,  .  /1  (9X^1  *3  ~  *1  *2  ]+  ft  (0  frl  -  ^2^3  ]+  h  ( 0  2  J3  -  s\  *3  ] 

D°{e)= - Aiois.-sMMoU-s.bMoTs^] 


(7) 


To  determine  the  coefficients  given  in  (7),  expressions  for  the  three  spatial  radiation  patterns  at  the  frequencies  Jj,  s2,  and  s3 
must  be  found  using  an  interpolation  scheme  in  the  spatial  domain.  A  similar  polynomial  expression  to  the  one  given  in 
(3)  also  dependent  on  8  and  with  order  k,  has  been  shown  previously  to  work  well  for  interpolating  one-dimensional 
spatial  radiation  patterns.  Using  this  formulation,  and  incorporating  a  piece-wise  scheme  to  interpolate  large  frequency 
ranges,  three  matrices  of  size  (*  +  1)  by  (*  +  1)  must  be  filled  and  inverted  for  each  piece  of  the  interpolation.  A  significant 
advantage  occurs  under  these  conditions  since  the  resulting  matrices,  which  are  small  compared  to  the  one  needed  for  the 
generalized  procedure  as  described  in  [6],  can  collectively  be  filled  and  inverted  in  a  relatively  short  period  of  time.  A 
second  advantage  is  that  the  expressions  given  in  (7)  allow  the  spatial  dependence  of  the  radiation  pattern  to  be  interpolated 
independently  from  the  spectral  variation,  such  that  any  function  sufficient  to  accurately  represent  the  spatial  radiation 
pattern  may  be  used  for /i(8),/2(8),  and/3(8).  Using  the  generalized  procedure,  the  entire  sampling  matrix  would  need  to 
be  recalculated  and  inverted  in  order  to  accomplish  this  same  task.  A  third  advantage  of  this  new  technique  is  that  toe 
reduced  size  and  convenience  of  using  much  smaller  interpolation  ranges  make  it  a  suitable  candidate  for  generalizing  toe 
MBPE  Windowed  Adaptive  Sampling  procedure,  described  in  [7-8],  for  use  in  simultaneous  interpolation.  This  would 
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allow  optimal  frequency  sampling  points  to  be  chosen  to  increase  the  effectiveness  of  the  simultaneous  interpolations,  and 
to  eliminate  trial-and-error  as  a  means  of  choosing  fitting  points. 

m.  RESULTS 

The  generalized  and  piece-wise  interpolation  procedures  were  applied  to  the  radiation  pattern  of  a  0.5  meter  dipole, 
modeled  from  150  to  950  MHz,  and  for  0  from  0°  to  90°.  For  the  generalized  hybrid  interpolation,  seven  fitting 
frequencies,  150,  300,  350,  600,  800,  900,  and  950  MHz,  and  eight  fitting  angles,  2,  10,  20, 40,  60,  80,  and  90°,  were  used 
to  interpolate  a  Padd  rational  function  of  orders  n  =  4  and  d  =  2,  and  with  polynomial  spatial  dependence  order  k  =  7  [6]. 
This  formulation  resulted  in  a  56  by  56  matrix  which  required  inversion  to  solve  for  the  unknown  parameters.  The  piece- 
wise  hybrid  MBPE  technique  was  performed  using  4  fitting  windows,  from  150  to  350  MHz  (with  fitting  frequencies  200, 
300,  and  350  MHz),  350  to  550  MHz  (with  fitting  frequencies  350,  425,  and  550  MHz),  550  to  700  MHz  (with  fitting 
frequencies  550,  600,  and  700  MHz),  and  750  to  950  MHz  (with  fitting  frequencies  750,  875,  and  940  MHz).  The  spatial 
radiation  patterns  at  the  nine  unique  sampling  frequencies  were  interpolated  using  k  =  l  polynomials,  and  using  the  same 
fitting  angles  as  in  the  generalized  technique.  This  information  was  then  used  to  determine  explicit  representations  for  the 
Padd  coefficients  given  in  (7).  The  difference  in  computation  time  required  to  perform  these  two  operations  was 
significant,  with  the  piece-wise  technique  being  much  faster  than  the  generalized  hybrid  interpolation.  Figure  1  shows  two 
sets  of  frequency  spectra  of  electric  field  magnitude  versus  frequency  for  0  =  30,  70,  and  90°.  The  left  column  shows 
results  from  the  application  of  the  piece-wise  hybrid  MBPE  interpolation,  and  the  right  column  the  generalized  technique, 
both  are  compared  to  the  exact  spectra  generated  via  MoM.  The  two  different  interpolation  procedures  both  show  excellent 
agreement  with  the  MoM  data. 

A  more  complete  picture  of  the  accuracy  acheieved  using  these  various  interpolation  procedures  is  found  by  examining  the 
gain  patterns  which  result  from  the  estimated  electric  fields.  To  determine  these  patterns,  a  procedure  which  uses  MBPE- 
estimated  antenna  input  impedance  as  a  function  of  frequency  to  transform  from  electric  fields  to  gain  patterns  was 
introduced  in  [6].  This  transformation  was  applied  to  three  interpolations  of  the  dipole  case  described  above,  the  first  two 
being  the  generalized  and  4-window  3-point  piece-wise  interpolations,  and  the  third  being  a  3-window  3-point  piece-wise 
interpolation  using  just  7  fitting  frequencies.  The  3  fitting  windows  used  in  this  interpolation  were  from  150  to  450  MHz 
(with  fitting  frequencies  200,  300,  and  450  MHz),  450  to  650  MHz  (with  fitting  frequencies  450,  550,  and  650  MHz),  and 
650  to  950  MHz  (with  fitting  frequencies  650, 875,  and  940  MHz).  Figure  2  shows  gain  pattern  slices  at  300  MHz  and  720 
MHz,  with  the  left  column  presenting  a  comparison  between  the  MoM,  generalized,  and  3-window  gain  patterns,  and  the 
right  column  showing  the  results  from  the  MoM,  generalized,  and  4-window  piece-wise  interpolations.  Figure  3  shows 
similar  plots  at  800  MHz  and  933  MHz.  The  results  in  Figures  2  and  3  show  that  all  three  interpolated  gain  patterns  agree 
very  well  with  the  MoM  gain  patterns  at  these  four  frequencies,  with  differences  of  a  few  dB  apparent  in  both  the  3-  and  4- 
window  piece-wise  interpolations  in  the  720  and  800  MHz  plots.  An  examination  of  the  overall  error  in  gain  throughout 
the  entire  range  of  operation  of  the  dipole  antenna,  from  150  to  950  MHz  and  over  all  points  in  space,  shows  that  the 
generalized  procedure  does  in  fact  produce  the  best  results.  The  3-window  interpolation  has  a  maximum  gain  error  of  2.9 
dB  with  an  average  error  of  about  1.0  dB,  the  4-window  interpolation  has  a  maximum  error  of  1.79  dB  with  an  average 
error  at  0.7  dB,  and  the  generalized  procedure  shows  a  maximum  error  of  only  0.5  dB  with  an  average  gain  error  of  less 
than  0.1  dB. 

It  is  clear  from  these  results  that  in  order  to  obtain  accuracies  comparable  to  those  achieved  by  solving  the  generalized  Pade 
rational  function  [2],  the  piece-wise  procedure  requires  a  greater  number  of  frequency  sampling  points.  When  interpolating 
the  entire  range  from  150  to  950  MHz  using  a  single  model,  information  on  the  global  structure  of  the  spectrum,  such  as 
the  locations  of  poles  and  zeros,  improves  the  agreement  between  the  actual  and  interpolated  curves  at  all  points  in  the 
model  range.  The  smaller,  isolated  windows  used  in  the  piece-wise  procedure  possess  information  only  on  the  local 
structure  of  the  spectrum,  and  so  more  fitting  frequencies  are  required  to  accurately  model  the  data.  One  method  for 
improving  the  ability  of  the  piece-wise  technique  to  model  complex  structures  would  be  to  employ  4-point  windows,  rather 
than  the  3-point  windows  considered  here,  thus  allowing  a  single  window  to  capture  more  of  the  overall  structure  of  the 
spectrum.  The  extension  of  this  technique  to  4-point  windows  is  currently  under  investigation,  as  is  research  into  other 
possible  spatial  domain  fitting  models,  and  the  use  of  the  adaptive  windowed  sampling  procedure  [7-8]. 
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By  including  spatial  dependence  in  the  numerator  and  denominator  coefficients  of  the  Pade  rational  function,  antenna 
radiation  patterns  can  be  interpolated  simultaneously  in  the  spatial  and  frequency  domains.  This  hybrid  MBPE  technique  is 
performed  by  directly  solving  the  generalized  Padd  rational  function  (2),  or  by  using  piece-wise  interpolation  with  the 
spectral  sampling  constrained  to  only  three  frequencies  per  interpolation  window.  The  piece-wise  procedure  allows  for 
easy  substitution  of  spatial-dependent  fitting  models  without  requiring  that  a  large  matrix  equation  be  reevaluated.  Another 
advantage  of  the  method  is  that  the  small  degree  of  the  interpolations  involved  in  the  piece-wise  procedure  is  amenable  to 
the  use  of  a  Windowed  Adaptive  Sampling  technique  for  the  efficient  determination  of  optimal  sampling  points.  A 
comparison  of  the  two  techniques  for  performing  simultaneous  interpolation  shows  that  both  produce  reconstructed  models 
that  are  in  excellent  agreement  with  the  original  MoM  data.  While  the  piece-wise  technique  does  require  on  average  more 
sampling  frequencies  than  the  generalized  interpolation  approach,  the  additional  flexibility  and  efficiency  of  the  piece-wise 
interpolation  procedure  provides  an  attractive  alternative. 
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Figure  1 :  Plots  of  MBPE  versus  MoM  electric  field  frequency  spectra  with  3-point  fitting  windows 
(left)  and  dual-domain  sampling  interpolation  (right)  methods.  The  fitting  frequency  windows  are 
150-350  MHz,  350-550  MHz,  550-700  MHz,  and  700-950  MHz,  with  the  fitting  frequencies  for  each 
case  defined  by  the  symbols  given.  The  MoM  curves  are  indicated  by  solid  lines  and  the  MBPE 
determined  curves  are  given  by  the  dashed  lines. 
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Figure  2:  Comparison  of  MoM,  generated  MBPE,  and  windowed  MBPE  gain  patterns  at  (a)  300  MHz  and 
(b)  720  MHz.  The  left  column  shows  results  from  using  3  windows  (7  fitting  frequencies)  for  the  piece-wise 
interpolation  procedure,  while  the  right  column  shows  results  from  the  4  window  (9  fitting  frequencies) 
piece-wise  MBPE  interpolation  (as  shown  in  the  frequency  spectra  of  Figure  1). 
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Figure  3:  Comparison  of  MoM,  generated  MBPE,  and  windowed  MBPE  gain  patterns  at  (a)  800  MHz  and 
(b)  933  MHz.  The  left  column  shows  results  from  using  3  windows  (7  fitting  frequencies)  for  the  piece-wise 
interpolation  procedure,  while  the  right  column  shows  results  from  the  4  window  (9  fitting  frequencies) 
piece-wise  MBPE  interpolation  (as  shown  in  the  frequency  spectra  of  Figure  1). 


293 


Full  Wave  Analysis  of  Generalized  Microstrip  Lines  using 
Model  Order  Reduction  Techniques  f 

Kaladhar  Radhakrishnan  and  Weng  Cho  Chew 
Center  for  Computational  Electromagnetics 
Department  of  Electrical  and  Computer  Engineering 
University  of  Illinois,  Urbana,  IL  61801 
Email:  kradhakr@uiuc.edu 

Abstract 

In  this  paper  we  have  developed  a  numerical  technique  to  carry  out  the  full  wave  analysis  of  a  generalized  microstrip 
line  on  an  inhomogeneous  substrate.  We  use  the  finite  difference  method  to  express  the  problem  in  the  form  of  a  sparse 
matrix  equation.  A  model  order  reduction  technique  is  used  to  express  the  problem  in  terms  of  a  smaller  matrix.  This  is 
accomplished  using  the  bi-Lanczos  algorithm.  The  overall  complexity  of  the  algorithm  is  0(N1J>).  Storage  requirements 
can  be  made  to  scale  as  O(JV)  making  it  possible  to  analyze  large  problems  on  small  computers.  Very  good  agreement  is 
seen  between  published  results  and  the  results  obtained  using  this  technique. 

1.  Introduction 

Recent  developments  in  microwave  integrated  circuits  have  led  to  complex  waveguiding  structures  with 
multiple  conductors  on  an  inhomogeneous  substrate.  Several  techniques  are  available  for  the  analysis  of  microstrip 
lines  [1].  More  recently,  these  techniques  have  been  extended  to  analyze  complicated  microstrip  lines  with  multiple 
conductors  on  a  multilayer  substrate  [*2]-[7].  However,  most  of  the  work  is  limited  to  layered  substrates  whose 
permittivity  is  a  piecewise  constant  function.  In  [7],  a  numerical  technique  was  developed  to  analyze  structures 
with  continuous  permittivity  profiles  along  the  lateral  coordinate.  This  situation  occurs  when  a  substrate  contains 
doping  zones  where  the  permittivity  is  different  from  that  of  the  background.  In  [3],  the  vector  finite  element 
method  is  used  to  analyze  the  inhomogeneous  multiconductor  transmission  line.  However,  their  analysis  is  limited 
to  quasi-TEM  modes.  To  analyze  the  most  general  type  of  microstrip  lines  with  an  inhomogeneous  substrate, 
one  needs  to  use  a  numerical  technique  like  the  finite  difference  or  the  finite  element  method.  This  results  in  a 
sparse  matrix  equation  with  a  large  number  of  unknowns.  By  using  a  model  order  reduction  technique  like  the 
bi-Lanczos  method  it  is  possible  to  solve  such  a  sparse  matrix  equation  efficiently. 

In  this  paper,  we  have  extended  a  numerical  technique  to  analyze  dielectric  waveguides  [8]  to  the  analysis  of 
microstrip  lines  on  inhomogeneous  substrates.  The  inhomogeneous  vector  wave  equation  is  used  as  the  governing 
equation.  This  allows  us  to  model  continuously  varying  permittivity  profiles  along  both  transverse  directions. 
The  transverse  component  of  the  wave  equation  is  then  discretized  using  the  finite  difference  technique  resulting 
in  a  sparse  matrix  equation.  The  eigenmodes  of  the  sparse  matrix  equation  correspond  to  the  various  modes 
of  the  microstrip  line.  Direct  spectral  decomposition  of  the  matrix  is  expensive  for  complicated  waveguiding 
structures  with  large  matrices.  However,  by  exploiting  the  sparsity  of  the  matrices,  it  is  possible  to  solve  for  the 
equation  much  more  efficiently.  This  is  accomplished  using  the  bi-Lanczos  algorithm. 


2.  Theory 

The  problem  can  be  solved  in  terms  of  the  transverse  electric  field  components  or  the  transverse  magnetic 
field  components.  It  has  been  shown  [9]  that  both  formulations  are  equivalent.  The  space-time  dependence  of 
the  electromagnetic  field  is  assumed  to  be  exp[i(&*z  —  wi)]. 

2.1 .  Finite  Difference  Formulation 

The  discretized  vector  wave  equations  can  be  derived  from  the  source  free  Maxwell’s  equations  and  is  given 
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294 


by  [10] 


V  X  pmJ+iV  xEm-  fi2emEm  =  0, 

(1) 

V  x  c^V  x  Hm+i  -  S22pm+iHm+i  =  0. 

(2) 

In  the  above  equation,  Vx  is  the  curl  operator  using  forward  difference  to  approximate  the  derivatives  while  Vx 
represents  the  curl  operator  using  backward  difference  to  approximate  the  derivatives.  Similarly,  Em  is  the  fore- 
vector  associated  with  the  point  (m,  n)  while  Hm+i  is  the  back-vector  associated  with  the  point  (m  +  n  +  5). 
This  causes  the  electric  and  the  magnetic  field  components  to  be  defined  over  a  staggered  grid. 

Matching  the  transverse  field  components  of  the  vector  wave  equations  and  eliminating  the  z  components, 
we  get  [9] 

pmV,  x  xEJ,-  V^V,  ■  -  k2Wm  +  k2Esm  =  0,  (3) 

««+iV.  x  tfV,  x  H^+i  -  •  pro+iH;+i  -  k2 +  k2Wm+ ,  =  0.  (4) 

Here,  the  term  E^  represents  the  transverse  component  of  the  electric  field.  Figure  1  describes  the  positioning 
of  the  transverse  components  of  the  electric  field  on  the  xy  plane.  The  dielectric  parameters,  p  and  e,  are  defined 
at  half  grid  points  as  well  as  the  integer  grid  points  to  minimize  the  discretization  error.  The  source  term  is 
assumed  to  be  localized  on  the  z  =  z'  plane  and  represents  the  current  element  used  to  excite  the  modes  in  the 
microstrip  line. 


Figure  1.  Location  of  the  Ex  and  Ey  points  around  the  conducting  strip. 

Equations  (3)  and  (4)  can  be  expressed  in  the  form  of  an  eigenvalue  problem 

£e  ■  E,  +  =  0,  (5) 

£h  ■  H,  +  k2 Hs  =  0.  (6) 

Both  matrices,  £e  and  £h  are  asymmetric  and  extremely  sparse.  Each  eigenvector  corresponds  to  the  transverse 
field  components  for  that  mode  while  each  eigenvalue  represents  k2  for  that  particular  mode.  The  bi-Lanczos 

algorithm  is  used  to  solve  for  the  eigenpairs  of  the  matrices.  Each  iteration  in  the  bi-Lanczos  algorithm  requires 

two  matrix  vector  multiplies.  One  computes  £e  ■  v  while  the  other  evaluates  £e  •  w.  It  is  straightforward  to 
compute  the  former  since  we  know  the  exact  definition  for  the  operator  £e.  In  order  to  compute  the  latter 
without  writing  out  the  entire  matrix,  we  seek  a  physical  meaning  to  £t  .  The  fact  that  £e  and  £h  share  the 
same  eigenvalues  suggests  that  the  latter  might  be  related  in  some  form  to  £e  .  This  is  indeed  the  case  and  it 
can  be  shown  that  [8], 

£Tt  -  -z  x  £h  ■  z  x  (7) 
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2.2.  Boundary  Conditions 

To  model  an  open  waveguide  structure  like  the  microstrip  line  on  a  finite  computational  domain,  we  need 
to  surround  the  microstrip  line  with  a  metallic  shielding.  The  boundary  conditions  satisfied  by  the  transverse 
electric  field  components  at  the  walls  of  a  rectangular  waveguide  of  width  a  and  height  b  are 


Ex  -  0 

^=0 

dy 


x  =  0,x  =  a 


dx  ~  \y  =  0,y  =  b  (8b) 

Ey  =  0j 

Since  the  formulation  is  valid  for  an  inhomogeneous  substrate,  it  is  possible  to  model  the  conducting  strip  as 
a  part  of  the  substrate  with  high  conductivity.  However,  this  is  not  desirable  since  it  leads  to  an  ill-conditioned 
matrix  with  slow  convergence  in  the  bi-Lanczos  method.  The  more  efficient  way  is  to  model  the  conducting  strip 
using  boundary  conditions.  Figure  1  shows  the  location  of  the  transverse  field  components  around  the  conducting 
strip.  The  conducting  strip  should  be  defined  on  the  plane  where  the  Ex  points  are  defined  and  should  end  at 
the  midpoint  between  two  adjacent  Ex  points.  This  corresponds  to  the  location  where  the  Ez  component  is 
defined.  It  has  been  shown  [11]  that  the  electric  field  component  parallel  to  a  conducting  edge  is  always  regular. 
By  avoiding  the  computation  of  the  Et  component  at  the  edge  of  the  conducting  strip,  we  can  keep  the  field 
finite  throughout  the  computational  domain. 

In  order  to  model  the  conducting  strip  using  boundary  conditions,  we  set  the  tangential  electric  field  to  be 
zero  on  the  surface  of  the  conducting  strip.  This  implies  that  all  Ex  components  which  fall  within  the  conducting 
strip  will  be  set  to  zero.  To  impose  the  boundary  conditions  on  the  field  points  immediately  surrounding  the 
conducting  strip,  we  make  use  of  the  condition  that  Ez  should  be  zero  on  the  conducting  strip.  Since  the  electric 
field  satisfies  the  divergence  free  condition  on  the  area  just  outside  the  conducting  strip,  the  Ez  component  can 
be  expressed  in  terms  of  the  transverse  field  components  as  shown  below. 

ikz  E^  =  e“1Vs-eInE^.  (9) 

While  evaluating  the  field  at  the  points  immediately  surrounding  the  conducting  strip,  the  Ez  term  makes  its 
presence  felt  through  the  second  term  in  Eq.  (3).  The  second  term  can  be  written  in  terms  of  Ez  as  shown 

belOT'  *  =,  ..  r.  - e'\  , , ,,m 


*•«=?*■  ■«•■*■ = *  i1 1  a*  ■ 

By  setting  the  Ez  components  defined  on  the  surface  of  the  conducting  strip  to  zero,  it  is  possible  to  simulate  the 
presence  of  the  conducting  strip.  It  is  straightforward  to  extend  the  boundary  conditions  to  model  a  conducting 
strip  which  is  not  infinitesimally  thin.  In  that  case,  one  needs  to  ensure  that  all  four  edges  of  the  conducting 
strip  end  on  Ez  points. 

2.3.  Localized  Current  Source  Response 

In  the  presence  of  a  localized  current  source,  J  at  z  -  z' ,  equations  (5)  and  (6)  become 

Z,E,  +  pE  ,=.,«(* -A  <n> 

Za-H,  + J^H,  =  (12) 

In  the  above  equation,  the  2  dependence  of  the  fields  has  been  supressed  and  se  and  sh  are  defined  as 

se  =  Xly. mJm  ~  Vs  ■  J^, 
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(14) 


Sh  —  Z  X  fjj^. 

The  generalized  formal  solution  to  the  sparse  matrix  equations  in  terms  of  matrix  functions  is 

%  =  ^£7=  (15) 

(16) 

To  solve  for  the  fields  using  equations  (15)  and  (16),  one  needs  to  evaluate  matrix  functions  of  the  form /(£g)-se. 
A  Krylov  subspace  based  method  is  the  ideal  choice  for  solving  such  sparse  matrix  functions  [12].  We  use  the 
bi-Lanczos  algorithm  [13]  to  solve  the  asymmetric  matrix  equation.  The  bi-Lanczos  algorithm  approximates  the 
original  matrix  Ce  of  size  N  x  N  with  a  smaller  tri-diagonal  matrix  of  size  M  x  M.  Also  generated  during  the 
bi-Lanczos  algorithm  are  two  sets  of  iteration  vectors,  V  and  W,  each  of  size  M  x  N.  The  relation  between  the 
iteration  vectors  and  the  matrices  can  be  summarized  as  follows. 


Ce  -  V  =  V  -  T 

(17) 

cl  -w  =  w  -tt 

(18) 

The  iteration  vectors,  V  and  W,  in  exact  arithmetic  are  bi-orthogonal  to  each  other.  In  practice,  however  they 
lose  their  orthogonality  after  a  few  iterations.  Another  alternative  is  to  use  the  more  robust  Arnoldi  method. 
The  drawback  with  the  Arnoldi  method  is  that  it  becomes  prohibitively  expensive  for  large  matrices  since  it 
performs  explicit  reorthogonalization  at  each  iteration. 

It  can  be  shown  that  any  matrix  function  of  the  form  /(£«)  •  sg  can  be  evaluated  as  [8] 

/(£«)  •  Vi  =  V  *  Q  •  /(A)  •  Q-i  ■  ei .  (19) 

Here  Q  represents  the  matrix  containing  the  eigenvectors  of  the  tridiagonal  matrix,  T  and  A  is  the  diagonal 
matrix  containing  the  eigenvalues  of  T.  ej,  is  the  first  unit-M  vector.  The  vector  se  is  chosen  as  the  starting  vector, 
vx  in  the  bi-Lanczos  iterations.  Thus  by  carrying  out  the  bi-Lanczos  algorithm  and  the  spectral  decomposition 
of  the  resulting  tridiagonal  matrix,  T,  we  can  use  Eq.  (19)  to  solve  the  sparse  matrix  equation.  The  solution  to 
Eq.  (15)  gives  the  field  components  at  any  given  z  plane  for  an  arbitrary  localized  current  source  at  z  —  z' .  As  a 
by-product  of  evaluating  the  current  source  response,  it  is  also  possible  to  solve  for  the  propagating  modes.  The 
eigenvalues  of  the  matrix  T  are  the  same  as  the  eigenvalues  of  the  matrix  £g.  Since  these  eigenvalues  represent 
Jb|  for  that  particular  mode,  it  is  possible  to  isolate  the  eigenpairs  (Aj, q*)  of  the  tridiagonal  matrix  T,  which 
correspond  to  the  propagating  modes.  From  these  eigenpairs,  it  is  possible  to  obtain  the  eigenpairs  of  the  Ct 
matrix,  (A,,  V  ■  q*).  These  eigenpairs  contain  all  information  that  is  required  about  the  propagating  modes. 

2.4.  Complexity  and  Storage  Issues 

The  two  main  steps  in  the  algorithm  are  the  bi-Lanczos  iterations  and  the  spectral  decomposition  of  the 
tri-diagonal  matrix.  Each  bi-Lanczos  iteration  has  two  matrix-vector  multiplies.  Since  the  matrices  are  sparse, 
the  matrix-vector  multiply  is  an  O(N)  operation.  The  number  of  bi-Lanczos  iterations  needed  to  propagate 
information  transversely  across  the  grid  in  the  simulation  domain  is  observed  empirically  to  scale  as  y/N.  It  should 
be  noted  that  apart  from  the  number  of  unknowns,  the  relative  permittivity  of  the  various  substrates  also  play  a 
role  in  determining  the  number  of  iterations  required  for  convergence.  This  is  because  of  the  multiple-scattering 
that  ensues  and  directly  affects  the  condition  number  of  the  matrix.  Another  factor  which  might  influence  the 
convergence  is  the  number  of  conducting  strips.  Since,  each  conducting  strip  has  a  singularity  associated  with 
its  edges,  the  more  the  number  of  conducting  strips,  the  higher  the  condition  number  of  the  matrix.  The  overall 
complexity  of  the  bi-Lanczos  algorithm  is  0(N15).  The  complexity  of  the  spectral  decomposition  procedure  is 
0(N 1,s)  since  the  size  of  the  matrix  T  scales  as  y/N .  If  it  is  desired  to  solve  for  just  the  propagating  modes,  it 
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is  possible  to  solve  for  just  those  eigenpairs  instead  of  carrying  out  the  whole  spectral  decomposition.  This  will 
make  the  algorithm  faster  but  will  not  affect  the  overall  complexity. 

The  iteration  vectors,  V  and  W  are  the  primary  bottleneck  in  limiting  the  storage  requirement.  Since  the 
iteration  vectors  W  are  not  needed  to  compute  the  field,  they  can  be  discarded  as  they  are  generated.  On  the 
other  hand,  one  needs  to  store  V  in  order  to  compute  the  current  response  or  to  compute  the  field  distribution 
for  the  propagating  modes.  Hence  the  storage  requirements  scale  as  0(Nir°)  which  is  not  desirable  for  solving 
large  problems.  Fortunately,  it  is  possible  to  circumvent  the  storage  bottleneck  as  discussed  below. 

In  order  to  evaluate  f(Ce)  -  vj  or  the  eigenvector  corresponding  to  the  propagating  modes,  we  need  to 
evaluate  a  matrix  equation  of  the  form  V  •  q,  where  q  is  a  vector  of  size  M .  During  the  bi-Lanczos  iterations, 
the  vectors  v;  are  evaluated  using  the  recursive  relation, 

vt+1  =  [£e  •  Vi  -  o(i')v{  -  7(1  -  l)vf_i]//?(i).  (20) 

In  the  above  expression  a,  /3  and  7  are  the  diagonal  and  sub-diagonal  elements  of  the  matrix  T.  If  we  store 
the  starting  vector,  vi,  it  is  possible  to  recursively  obtain  the  subsequent Jiteration  vectors  using  Eq.  (20).  As 
each  vector  Vi  is  computed,  its  contribution  to  the  matrix- vector  product  V  q  is  evaluated  and  then  the  vector 
is  discarded.  This  process  of  regenerating  the  iteration  vectors  takes  less  than  half  the  time  required  for  the 
bi-Lanczos  iterations  because  it  needs  only  one  matrix  vector  multiply  and  and  the  terms  a,  0  and  7  need  not 
be  recomputed.  This  increases  the  overall  computational  time  of  the  algorithm  by  roughly  10%  but  reduces  the 
storage  to  a  small  fraction  of  the  original  requirements.  For  small  problems  it  is  feasible  to  store  the  iteration 
vectors,  but  for  larger  problems  it  is  imperative  to  discard  and  regenerate  the  iteration  vectors. 


Cover  Plane 


(a) 


(t») 

Figure  2.  (a)  Wide  microstrip  line  structure,  ei  =  9.7eo>  w  —9-15  mm,  h  =1.92  mm,  d  =0-64  mm.  (b)  Dispersion 
curves  for  the  structure  shown  in  Fig.  2(a). 
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3.  Numerical  Results 


To  study  the  performance  of  the  numerical  technique  described  in  this  paper,  we  consider  three  types 
of  microstrip  structures.  The  first  structure  analyzed  is  a  simple  wide  microstrip  line  shown  in  Figure  2(a). 
Dispersion  curves  for  the  first  three  modes  are  shown  in  Figure  2(b).  The  results  are  compared  with  those 
published  by  Jansen  [14].  These  results  were  confirmed  later  using  an  analaytic  technique  in  [15].  Very  good 
agreement  is  seen  between  the  two  results. 

The  next  example  is  that  of  a  three  conductor  stripline  structure  shown  in  Figure  3(a).  There  are  three 
distinct  modes  associated  with  this  structure.  Figure  3(b)  plots  the  dispersion  curves  for  these  three  modes.  The 
results  are  compared  with  those  published  by  Yang  et  al  [4].  Once  again,  very  good  agreement  is  seen  between 
the  two  results. 
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Figure  3.  (a)  Three  conductor  stripline  structure,  ei  =  €3  =  9.7e0,  £3  —  4e0,  w/h  =  1.0,  s/h  =  0.1, 
d  =  h.  (b)  Dispersion  curves  for  the  structure  shown  in  Fig.  3(a). 

In  our  final  example  we  consider  a  microstrip  line  on  top  of  a  laterally  segmented  substrate  as  shown  in 
Figure  4(a).  This  example  was  chosen  to  show  the  ability  of  this  technique  to  handle  lateral  inhomogeneities. 
Dispersion  curves  are  plotted  in  Figure  4(b)  for  the  fundamental  mode  at  various  values  of  6.  The  structure 
reduces  to  a  simple  homogeneous  substrate  for  values  of  b  =0  mm  and  b  =2.5  mm.  The  results  are  compared 
with  those  published  by  Kiang  [6].  Excellent  agreement  is  seen  between  the  two  results. 


299 


I  1 .  :  -  :i  II 

U - 1, - .4* - a-2b - 4- - b - 4 

(a) 
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Figure  3.  (a)  Microstrip  line  on  a  segmented  substrate,  a  =5  mm,  w  =1  mm,  d  =1  mm.  (b)  Dispersion 
curves  for  the  structure  shown  in  Fig.  4(a). 


4.  Conclusions 

We  have  developed  an  efficient  algorithm  to  solve  the  generalized  microstrip  line  problem  with  multiple 
conductors  on  an  inhomogeneous  substrate.  The  strength  of  this  technique  is  that  the  same  algorithm  can  be 
used  to  analyze  different  microstrip  structures  by  feeding  in  the  permittivity  profile  of  the  substrate  and  the  size 
and  location  of  the  conducting  strips.  This  method  can  also  be  expanded  to  include  anisotropic  substrates  as  in 
[9].  The  numerical  results  obtained  using  this  technique  agree  well  with  previously  published  results.  A  possible 
area  for  future  work  is  the  use  of  PML  to  truncate  the  computational  domain.  This  will  help  reduce  the  number 
of  unknowns  in  open  microstrip  structures.  Another  way  to  accomplish  this  by  using  a  subgridding  technique 
which  will  also  help  model  complex  geometries  accurately. 
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Abstract 

The  hybrid  finite  element-boundary  integral  (FE-BI)  method  has  been  quite  popular  due  to  its 
adaptability  in  modeling  arbitrarily  shaped  objects  involving  complex  materials.  However,  being  a 
frequency  domain  analysis,  the  FE-BI  method  may  not  be  appealing  for  broadband  frequency-dependent 
data  generation.  Also,  it  is  unavoidable  to  resolve  the  hybrid  system  for  each  excitation  (incidence  angle) 
when  an  iterative  solver  is  employed.  Subsequently,  monostatic  radar  cross  section  (RCS)  calculations  are 
computationally  intensive,  and  therefore  asymptotic  waveform  evaluation  (AWE)  as  an  extrapolation 
method  becomes  attractive  for  CPU  time  reduction.  In  this  paper,  we  present  frequency  and  angular 
extrapolation  implementations  in  FE-BI  systems.  Scattering  applications  of  the  hybrid  technique  in 
conjunction  with  AWE  will  be  presented  to  demonstrate  the  performance  of  these  implementations  in  terms 
of  accuracy  and  computational  efficiency  . 


1  Introduction 

Hybrid  FE-BI  methods  have  gained  wide  acceptance  because  they  combine  the  rigor  of  the  integral  equation 
for  truncating  the  mesh  with  the  material  generality  and  geometrical  flexibility  of  the  finite  element  method  [1], 
In  these  methods,  a  matrix  system  is  formed  and  solved  to  obtain  the  desired  parameters  by  using  a  direct  or  an 
iterative  solver.  For  electrically  large  problems,  however,  the  solution  of  the  matrix  system  is  computationally 
intensive  and  must  be  repeated  at  finer  frequency  increments.  In  addition,  the  solution  must  be  independently 
carried  out  for  each  angle  of  incidence  in  case  of  an  iterative  solver.  Hence,  monostatic  RCS  calculations  are 
computationally  intensive;  therefore  AWE  or  other  types  of  extrapolation  methods  become  attractive  for  CPU 
time  reduction. 

The  AWE  method  provides  a  reduced-order  model  of  a  linear  system  that  has  been  successfully  used  in 
various  electromagnetic  and  circuit  applications  [2]-[7].  It  is  worthy  to  note  that  every  aspect  of  the  method  of 
model-based  parameter  estimation  (MBPE),  which  is  identical  to  AWE  in  nature,  has  recently  been  reviewed  by 
Miller  [8]-[10].  Furthermore,  Miller  introduced  an  MBPE  based  approach  called  windowed-adaptive-sampling 
pattern  estimation  (WASPE)  to  obtain  radiation  and  scattering  patterns  [1 1].  In  particular,  the  implementation  of 
the  AWE  technique  in  conjunction  with  the  method  of  moments  (MoM)  and  the  hybrid  FE-BI  method  has  also 
appeared  in  the  recent  literature  [12] -[14].  These  implementations  dealt  with  accurate  and  efficient  frequency- 
dependent  data  estimations.  Also,  a  pattern  fill  implementation  for  MoM  systems  has  been  introduced  in  [14]  as 
an  application  of  angular  extrapolation. 

In  this  paper,  we  first  present  a  brief  review  of  formulation  for  frequency  and  angular  extrapolations  applied  in 
FE-BI  and  MoM  systems,  respectively,  given  in  [14].  We  will  then  demonstrate  the  efficiency  and  accuracy 
afforded  by  these  implementations  for  3D  scattering  problems. 
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2  Frequency  Extrapolation  for  FE-BI  Systems 

On  the  basis  of  AWE,  a  Taylor  series  expansion  of  the  matrix  system  (MoM  or  FE-BI)  is  generated  about  a 
specific  value  of  the  system  parameter  (frequency  or  angle).  The  expansion  coefficients  or  moments  are  then 
used  to  extract  poles  of  the  system,  yielding  a  rational  function  (Pade  approximation)  of  the  system  parameter. 
Pade  representations  have  a  larger  circle  of  convergence  and  can  therefore  provide  a  broader  extrapolation 
because  they  include  not  only  zeros  but  also  poles  of  the  response.  This  representation  results  in  an  extension  of 
the  region  of  convergence  (RoC)  of  the  power  series,  thus  increasing  the  accuracy  of  the  analysis  to  a  wider 
range.  In  this  section,  we  will  describe  the  implementation  of  AWE  into  FE-BI  systems  for  frequency 
extrapolation. 

In  accordance  with  the  FE-BI  procedure  [15],  cavity-backed  structures  are  formulated  using  the  finite  element 
method  for  the  fields  within  the  cavity  and  the  boundary  integral  for  mesh  truncation  across  the  aperture.  With 
frequency  /as  the  parameter  of  interest  for  extrapolation,  discretization  of  the  field  equations  using  tetrahedron  in 
the  volume  and  triangles  on  the  aperture  [1]  in  conjunction  with  Galerkin's  method  yields  the  system 

&u*>]  {«.(*)>={*.(*»  a) 

where  [Am„(k)\  is  a  partly  sparse,  partly  dense  complex  symmetric  matrix,  {gm(£)}  is  the  excitation  vector,  {e„(k)} 
is  the  unknown  electric  field  coefficient  vector,  and  k  is  the  free  space  wave  number  representing  the  frequency 
dependence  of  the  system.  For  simple  media,  [Amn(k)\  is  a  linear  combination  of  matrices  with  k  dependencies  in 
the  form  of  it2  (due  to  FE-part),  e'jk  and  JfcV*  (due  to  Bl-part)  [14].  However,  this  dependence  may  become  more 
complicated  for  frequency-dependent  anisotropic  media.  For  instance,  depending  on  the  direction  of  applied  dc 
bias  field,  corresponding  entries  in  permeability  tensor  of  a  ferrite  material  become  rational  functions  in 
frequency.  In  addition,  the  ^-dependence  of  gm(k)  is  determined  by  the  type  of  excitation  being  applied.  In  case 
of  a  plane  wave  excitation,  it  takes  the  form  of  ke?.  On  the  other  hand,  for  a  radiation  problem,  this  form  is 
specified  by  the  type  of  the  feed  structure  being  used.  For  instance,  g(k)  is  a  linear  function  of  k  for  a  probe- 
current  feeding,  whereas  it  is  a  function  of  keik  for  a  coaxial  line  feed  structure  [14]. 

To  apply  AWE  to  the  system  (1),  we  begin  by  expanding  the  unknown  field  vector  e(k)  at  a  specific  value  of 
k=k0as 

{*„(*)}  “  -kj,  {Ml)  =  {*„(*„))  (2) 

where  the  moments  {Msn}  for  s>0  given  by 

k?<*o)]Vr,j M  (3) 

Here  [/fffln(Ab)]'‘  is  the  inverse  of  [4m„(£0)3  and  Am„(k0)  is  the  derivative  of  Amn{k)  with  respect  to  k  evaluated  at 
k0.  Similarly,  gJs\ko)  is  the  5th  derivative  of  gjjc)  with  respect  to  k  evaluated  at  k0.  Once  the  AWE  moments  are 
obtained,  the  electric  field  coefficients  at  frequencies  around  the  expansion  frequency  can  be  calculated  via  (2). 
However,  the  accuracy  of  (2)  quickly  deteriorates  as  k  moves  away  from  k0.  To  extend  the  utility  of  the 
expansion,  its  Pade  equivalent  can  in  turn  be  used  to  obtain  the  frequency  response  [12]-[14], 

The  basic  starting  point  for  the  Pade  approximation  is  to  obtain  the  Taylor  series  expansion  of  a  given 
function  about  a  reference  point.  Thus,  each  entry  of  the  vector  {e„(£)}  in  (2)  is  considered  as  a  complex  function 
of  k  and  can  be  expanded  as 

«.(*>= <4> 

Using  this  notation,  the  Pade  representation  for  each  e„(k)  is  then  given  by 

pn(UM) = -*o)"j  <5) 
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where  the  coefficients  a[  and  b"  are  obtained  by  satisfying  the  equality 

TloM^k-koy  =  P„  (L /M)  +  0(kL+M+i )  (6) 

The  integers  I  and  Mare  the  orders  of  the  zero  and  pole  expansions,  respectively.  The  coefficients  a[  and  b™ 
are  then  found  from 

Y'  Ml~Jb}  =  -M‘n  ,  L  +  l<l <L+M  ,  l>M:b}„=  0  (7) 

X-rf/ss  1  n  n  n 

a\  =  M°  ,  tl,  +  .  IS  1ZL  (8) 

To  demonstrate  the  frequency  extrapolation  implementation  described  above,  we  consider  the  scattering  by  a 
ferrite  loaded  cavity-backed  aperture  [16]  as  shown  in  Fig.  1.  Ay-directed  dc  biasing  of  Ho=8000e  is  applied  in 
the  ferrite  layers  with  a  magnetization  of  47iMs=445G.  The  discretization  of  the  cavity  volume  resulted  in  4182 
unknowns  of  which  133  were  on  the  aperture.  In  Fig.  1,  the  RCS  response  in  the  frequency  band  of  0.7-1  GHz  is 
displayed  for  exact  FE-BI  and  Pade  approximations  with  Z,=M=  2  for  two  different  expansion  points,  7o=0.8GHz 
and/0=0.9GHz.  As  seen,  both  AWE  results  agree  with  the  exact  solution  quite  well.  In  particular,  during  the 
analysis  of  this  problem,  it  was  observed  that  the  resonance  frequency  (^=0.85GHz)  is  captured  by  AWE 


Frequency  (GHz) 

Figure  1.  Monostatic  RCS  (normal  incidence:  ^=^=0°)  frequency  response  of  a  ferrite  loaded  cavity  (di  =0.726, 
d2= 1.790,  d3=0.737,  d,=0.762,  ds=1.065  in  cm):  FE-BI  (exact)  vs.  AWE  (approximate). 

solutions  with  a  reasonable  accuracy  regardless  of  the  choice  of  expansion  point.  For  CPU  time  comparisons,  we 
used  31  frequency  points  for  a  reasonable  approximation  of  the  RCS  response  using  the  FE-BI.  The 
corresponding  CPU  time  was  1  hour  20  minutes.  On  the  other  hand,  Pade  (2/2)  approximation  was  carried  out  in 
30  minutes  which  is  at  least  half  of  the  CPU  time  consumed  by  the  exact  solution.  Clearly,  AWE  provides  a 
noticeable  improvement  in  CPU  time,  as  expected.  It  is  important  to  note  that  the  FE-BI  solution  was  obtained 
using  an  iterative  solver,  but  the  AWE  solutions  were  produced  by  employing  a  direct  solver  because  of  a  need  to 
use  the  inverse  of  the  system  matrix  every  time  an  AWE  moment  is  generated  (see  Eqn.  3).  When  the  FE-BI 
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solution  is  also  obtained  via  a  direct  solver,  the  CPU  time  reduction  with  AWE  becomes  outstanding  with  a  value 
of  95%. 

3  Pattern  Fill  Implementation  in  MoM  Systems 

MoM  has  been  a  popular  tool  for  accurate  prediction  of  RCS  calculations  [17].  Its  implementation  in 
connection  with  the  electric  field  integral  equation  (EFIE)  involves  a  solution  of  the  electric  current  surface 
density  using  a  direct  or  some  iterative  solver.  With  incidence  angle  9  as  the  parameter  of  interest  for 
extrapolation  (at  a  fixed  <£-cut  and  for  a  specific  frequency),  a  usual  form  of  the  MoM  system  is  given  by 

where  [Zmn]  is  a  full  complex  matrix  which  is  independent  of  incidence  angle  in  nature,  I„  refer  to  the  unknown 
current  density  coefficients,  and  {Vm(9)}  is  the  excitation  vector.  It  is  apparent  that  only  dependence  on  0is  due 
to  the  excitation,  and  for  a  plane  wave  incidence,  Vm(9)  becomes  a  function  of  g^eism0+ctem0'>  ^  which  C\  and  c2 
are  constant  terms  with  respect  to  9. 

For  monostatic  pattern  evaluations  using  a  direct  solver  it  is  necessary  to  carry  out  the  matrix  vector  product 
\Zmn]'] {Vm(9)}  for  each  excitation.  On  the  other  hand,  for  iterative  solvers,  the  entire  solution  must  be  repeated 
for  each  incidence  angle.  AWE  can  again  be  employed  to  eliminate  repetition  of  the  iterative  solutions  altogether 
except  for  a  few  pattern  points  used  as  the  expansion  points  of  the  angular  extrapolation. 

To  show  how  AWE  can  be  applied,  we  assume  that  {1„(9)}  has  already  been  computed  at  a  given  direction 
(#),#>)■  A  Taylor  series  expansion  on  £(with  ^  constant)  is  then  given  by 


{/,(«)}= SI  k  -».)*.  k°)=  ft.®.)} 

(10) 

(11) 

where  V^(0o)is  the  s*  derivative  of  Vm{9)  with  respect  to  devaluated  at  $>.  For  plane  wave  excitations,  the 
moments  {A/*}  can  be  trivially  calculated,  and  one  could  therefore  increase  the  order  of  the  expansion  as  needed 
to  extend  the  validity  of  the  approximation.  This  is  better  achieved  by  casting  (10)-(1 1)  into  a  Pade  rational 
function.  In  this  case  equations  (4)-(6)  are  applicable  provided  e„,  k,  and  k0  are  replaced  by  1„,  9  and  6b, 
respectively.  We  also  note  that  the  angular  extrapolation  described  here  is  also  applicable  for  FE-BI  systems  in  a 
similar  manner. 

To  demonstrate  the  proposed  pattern  fill  procedure,  we  consider  the  scattering  by  a  metallic  plate  {TkxZk) 
with  three  circular  holes  as  shown  in  Fig.  2.  In  this  case,  the  resulting  full  matrix  size  was  649x649.  The 
incident  electric  field  is  given  by 

Eiric(0)  =  yeMxsiD*-2COse)  (12) 

i.e.  <f>  =  0°.  The  monostatic  RCS  pattern  was  first  computed  at  d=14°,  38°,  58°,  75°  using  an  iterative  MoM 
code  and  these  points  were  then  used  as  expansion  points  to  generate  Pade  approximations  with  1=3,  M=\  and 
L=M=  1.  As  seen  in  Fig.  2,  each  expansion  best  approximates  the  reference  RCS  pattern  in  Ihe  corresponding 
angular  sector.  In  proving  CPU  time  speed-ups,  we  used  91  MoM  points  as  reference.  The  corresponding  CPU 
time  was  1  hour  37  minutes.  On  the  other  hand,  the  multi-point  Pade  approximation  at  181  points  was  carried 
out  in  25  minutes  which  is  about  a  quarter  of  the  CPU  time  used  by  the  MoM.  This  comparison  clearly  shows 
how  monostatic  RCS  patterns  can  be  obtained  much  fester  by  means  of  AWE.  Of  course,  as  the  number  of 
angular  (or  frequency)  points  in  the  reference  solution  increases,  AWE  becomes  more  attractive  in  terms  of  CPU 
cost.  As  a  result  of  a  systematic  study  for  AWE  order  vs.  the  angular  band  of  the  approximation,  we  observed 
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that  a  Pade  expansion  with  1=1,  M=  1  or  Af=0  results  in  a  sufficiently  accurate  approximation  if  one  expansion 
point  is  chosen  for  every  angular  sector  with  a  single  lobe/null. 


0  (degree) 


Figure  2.  Monostatic  RCS  pattern  of  a  square  PEC  plate  with  three  circular  holes  (n  =0.1 5 A,,  r2= 0.25X, 
r3= 0.7X):  MoM  (exact)  vs.  AWE  (approximate). 

4  Conclusion 

In  this  paper.  We  presented  frequency  and  angular  extrapolation  implementations  in  hybrid  FE-BI  systems  for 
generating  broadband  RCS  patterns  using  only  a  few  points  of  the  exact  solution.  Once  again,  it  was 
demonstrated  that  AWE  is  a  fast  and  accurate  (within  a  bandwidth  of  the  expansion  point)  model  order  reduction 
technique  We  should  point  out  that  the  accuracy  and  extrapolation  range  of  the  AWE  implementation  depend 
upon  several  factors  such  as  pattern  shape,  location  of  the  expansion  points  and  the  order  of  the  Fade 
representation.  If  these  design  parameters  are  addressed  properly,  AWE  proves  to  be  an  alternative  method  for 
the  solution  of  scattering  as  well  as  radiation  problems  in  frequency  domain  electromagnetics. 
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1.  Introduction 

In  a  previous  paper  [1]  we  presented  a  reduced-order  modeling  technique  for  the  computation  of 
two-dimensional  transient  electromagnetic  wave  fields  in  complex  configurations.  Here  we  extent 
the  technique  to  the  three-dimensional  case.  Maxwell’s  equations  are  taken  as  a  starting  point. 
These  equations  form  a  hyperbolic  system  of  coupled  first-order  partial  differential  equations  and 
are  written  in  a  particular  matrix  operator  form.  We  then  discretize  these  equations  in  space 
and  in  such  a  way  that  the  structure  of  Maxwell’s  equations  is  preserved.  The  solution  of  the 
resulting  algebraic  matrix  equation  gives  a  numerical  approximation  of  the  electromagnetic  held 
quantities  and  can  be  written  in  terms  of  a  matrix  exponential  function  operating  on  a  source 
vector.  The  argument  of  the  exponential  function  consists  of  a  product  of  a  large  and  sparse 
matrix  and  the  continuous  time  coordinate.  The  problem  with  the  expression  for  the  solution 
vector  is  that  standard  decomposition  techniques  for  evaluating  the  matrix  exponential  function 
cannot  be  applied  since  the  order  of  the  relevant  matrix  is  simply  too  large.  However  we  do  know 
that  this  matrix  has  certain  symmetry  properties  because  the  structure  of  Maxwell's  equations 
has  been  preserved.  We  make  use  of  these  symmetry  properties  in  a  modified  Lanczos  algorithm. 
This  algorithm  allows  us  to  construct  approximations  to  the  solution  vector  without  having  to 
discretize  the  time  variable.  Computing  such  an  approximation  again  requires  the  evaluation  of 
a  matrix  exponential  function,  but  this  time  we  need  to  compute  the  exponential  of  a  tndiagonal 
matrix.  This  matrix  is  generated  by  the  Lanczos  algorithm  and  its  order  is  much  smaller  than 
the  order  of  the  matrices  that  result  from  discretizing  Maxwell’s  equations  directly.  For  this 
reason,  we  call  the  approximations  reduced-order  models.  Numerical  results,  for  three-dimensional 
configurations,  illustrate  the  performance  of  the  technique. 

2.  Maxwell’s  equations 

Maxwell’s  equations,  governing  the  pointwise  behavior  of  the  electromagnetic  field  present  in  an 
inhomogeneous,  isotropic  and  lossy  medium,  can  be  written  in  the  form 


(V  +  Mi  +  M2dt)IF  =  Q'.  (1) 

In  this  equation,  T  =  T{x,  t )  is  the  field  vector  consisting  of  the  components  of  the  electric  field 
strength  E  and  the  magnetic  field  strength  H  as 

T  =  [Elt  E2,  E3,  Hx,  H2,  H3)t,  (2) 


308 


and  Q!  =  Q'{x,  t )  is  the  source  vector  composed  of  the  components  of  the  external  electric-current 
sources  Je  and  the  external  magnetic-current  sources  Ke  as 

a  =  -{JlJt,JI,Ki,KlKlf.  (3) 


The  spatial  differentiation  operator  matrix  V  is  given  by 

/  0  0  0  0  dz  ~d2  \ 

0  0  0  -dz  0  dx 

0  0  0  d2  -di  0 

0  -dz  d2  0  0  0 

03  0  -3i  0  0  0 

V  *"^2  di  o  o  o  o  / 

and  satisfies  the  anti-commutation  relation 

V8~  =  -5~V, 

where  we  have  introduced  the  signature  matrix  8~  as 

<T  =  diag(l,l,  1,-1, -1,-1). 

We  also  introduce  the  identity  matrix  5+  as 

<5+  =  diag(l,  1, 1,  l,.l,  1), 

and  the  matrices  <5E  and  8K  as 

<5E  =  \{5+  +  8~ )  =  diag(l,  1, 1, 0, 0, 0), 


(4) 

(5) 

(6) 

(7) 

(8) 
(9) 


and 

5n  =  l-(S*  -  S')  =  diag(0, 0, 0, 1, 1, 1). 


(10) 


The  anti-commutation  relation  of  Eq.  (5)  can  then  be  rewritten  in  terms  of  the  matrices  8E  and 
<5H  as 


V8E  =  8nD, 


(11) 


or  as 


V8n  =  8EV.  (12) 

Equations  (11)  and  (12)  show  that  if  matrix  V  operates  on  a  vector  related  to  the  electric  field 
strength,  a  vector  related  to  the  magnetic  field  strength  results  and  vice  versa. 

The  time-independent  matrices  and  At 2  are  medium  matrices  given  by 


Mi  =  diag(cr,  <7,  <7, 0, 0, 0), 


(13) 
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and 

M2  =  diag(e,  e,  e,  p,  yu,  //). 

(14) 

Matrix  M\  is  positive  semidefinite  and  matrix  M2  is  positive  definite, 
relations 

These  matrices  satisfy  the 

II 

i 

•-o 

II 

1 

•o 

(15) 

and 

M28~  =  S~M2. 

(16) 

In  what  follows,  we  consider  source  vectors  of  the  form 

Q'(x,t)  =  w(t)Q(x), 

(17) 

where  w(t)  is  the  source  wavelet  that  vanishes  for  t<  0  and  Q  —  Q(x)  is  a  time-independent  vector. 
The  source  vector  is  said  to  be  of  the  electric-current  type  if  the  vector  Q  satisfies  Q  =  SEQ  and 
of  the  magnetic-current  type  if  this  vector  satisfies  Q  =  SHQ. 

As  a  next  step,  we  discretize  Maxwell’s  equations  in  space  and  in  such  a  way  that  Eqs.  (5), 
(15),  and  (16),  valid  in  the  continuous  context,  have  a  counterpart  after  discretization.  A  simple 
discretization  procedure  that  satisfies  this  requirement  is  the  standard  finite-difference  technique 
of  Yee  [2] .  We  employ  a  homogeneous  Dirichlet  boundary  condition  to  restrict  the  unbounded 
domain  of  computation  to  a  bounded  one.  The  discrete  counterparts  of  V,  A4i,  A42,  F,  and  2 
are  given  by  D,MUM2,F ,  and  Q,  respectively.  The  discrete  counterparts  of  the  matrices  d  , 
<5+,JE,  and  SH  are  denoted  by  the  same  symbols.  After  this  discretization  procedure  we  obtain 
the  algebraic  matrix  equation 

(D  +  Mi  +  M2dt)F(t)  =  w{t)Q.  (18) 

The  matrices  in  this  equation  are  all  square;  matrix  D  is  real  and  anti-symmetric;  and  the  medium 
matrices  Mx  and  M2  are  both  diagonal,  M1  being  positive  semidefinite  and  M2  being  positive 
definite.  Note  that  the  order  of  the  matrices  is  proportional  to  6N3,  where  N  is  the  number  of 
sample  points  in  each  Cartesian  direction. 

Solving  Eq.  (18)  for  the  field  vector  gives 

F(t)  =  w(t )  *  x(£)  exp (— A£)M2_1Q,  (!9) 


where  the  asterisk  denotes  convolution  in  time,  where  x{t)  is  the  Heaviside  unit  step  function, 
and  where  matrix  A  is  given  by 

A  =  +  (20) 


Now  let  u  be  a  vector  such  that  the  quantity 

uTF(t )  =  w{t)  *  x(t)uT  exp(-A£)M2“1Q  (21) 

is  a  component  of  the  electric  or  the  magnetic  field  strength  at  a  desired  observation  point.  If 
a  component  of  the  electric  field  strength  is  selected,  the  vector  u  satisfies  u  =  5  u,  and  if  a 
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component  of  the  magnetic  field  strength  is  selected  the  vector  u  satisfies  u  —  SHu.  For  a  source 
vector  of  the  electric-current  type  ( Q  —  5EQ),  we  then  have  the  following  reciprocity  relations: 

uTexp(-J4t)M2"1Q  =  QT  exp(-Af)M2-1u,  (22) 


for  u  =  6Eu,  and 


uT  exp(—At)M2  XQ  =  -QT  exp(-At)M2  x«,  (23) 

for  u  =  The  proof  of  these  relations  can  be  found  in  the  Appendix.  It  is  based  on  the 
observation  that  matrix  A  satisfies  the  relation 

AtM2S~  =  M28~A.  (24) 

For  the  sake  of  completeness  we  mention  that  similar  reciprocity  relations  hold  in  case  the  source 
vector  is  of  the  magnetic-current  type  (see  [3]). 

The  problem  with  the  expression  for  the  solution  vector  is  that  standard  methods  that  use  a  full 
decomposition  of  matrix  A  for  evaluating  the  matrix  exponential  function  cannot  be  used  because 
the  order  of  this  matrix  is  simply  too  large.  To  give  an  idea  of  its  size,  realistic  three-dimensional 
configurations  require  a  computational  grid  such  that  the  order  of  matrix  A  can  become  as  large 
as  10'.  Instead  of  computing  a  full  decomposition,  we  propose  to  calculate  a  partial  decomposition 
of  matrix  A  by  means  of  a  modified  Lanczos  algorithm.  Based  on  this  decomposition,  we  can 
construct  approximations  to  the  solution  vector  without  having  to  discretize  the  time  variable. 

3.  Reduced-order  models  for  the  electromagnetic  field  quantities 

Let  {•,  •)  denote  the  standard  inner  product  of  two  vectors  from  Rn.  By  using  the  reciprocity 
relation  of  Eq.  (24)  it  is  easily  verified  that  matrix  A  is  symmetric  with  respect  to  the  bilinear 
form  ( M28~  ■,  ■).  This  allows  us  to  carry  out  the  following  modified  Lanczos  algorithm: 

PiVi  =  M2~lQ 
Wi  =  Avt-fovi- 1 
a,  =  { M26~Vi,Wi ) 

Pi+lVi+l  -  Wi  -  OLiVi 

with  Vq  =  0.  The  algorithm  produces  coefficients  a*  and  pi  and  Lanczos  vectors  V{.  The  co¬ 
efficients  Pi  are  determined  from  the  condition  { M2S~Vi,Vi )  =  1,  for  i  >  1.  After  m  steps 
of  the  above  algorithm,  the  first  m  Lanczos  vectors  form  a  basis  of  the  Krylov  space  Km  = 
span{Af2-1<2>  AM2_1<3, Am-1M2-1Q},  and  the  summarizing  equation 

AVm  —  VmTm  -I-  pm+ivm+ien  (26) 


for  i  —  1,2,..., 


(25) 


holds.  In  this  equation,  matrix  Tm  is  a  complex,  symmetric,  and  tridiagonal  m-by-m  matrix  given 
by 


foci  p2  > 

P2  oc2  Pz 
Pz  •  • 

•  *  Pm 

\  Pm  / 


(27) 
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and  matrix  has  the  column  partitioning  Vm  =  (vi,  v2, vm).  Note  that  complex  arithmetic  is 
introduced  in  the  above  algorithm.  A  way  to  avoid  this  is  described  in  [3]. 

Based  on  Eq.  (26),  we  can  construct  the  reduced-order  model  (see  [l]) 

Fm(t)  =  w{t)  *  xWAKn  exp(~Troi)ei.  (28) 

It  can  be  shown  that  the  number  of  iterations  needed  to  obtain  an  accurate  result  on  the  time 
interval  (0,t]  is  proportional  to  ||A||  t,  where  \\A\\  is  the  2-norm  of  matrix  A  (see  [3j). 


4.  Numerical  results 

We  demonstrate  the  performance  of  the  reduced-order  modeling  technique  by  two  numerical 
examples.  Consider  a  three-dimensional  configuration  in  which  the  external  magnetic-current 
sources  vanish  and  where  the  external  electric-current  sources  are  given  by 

J°(x,  t)  =  w(t)6(x  -  x£TC ),  Jl(x,  t)  =  0,  Jf  (®,  t)  =  0.  (29) 

The  source  wavelet  w{t)  is  chosen  to  be  a  Ricker  wavelet  and  is  given  by 


w(t)  -  x(t) 


(30) 


With  the  parameter  t0  we  can  shift  the  non-zero  part  of  this  wavelet  m  time,  and  with  the 
parameter  9  we  can  vary  its  peak  frequency.  In  all  examples,  we  choose  a  peak  frequency  ol 
40  MHz.  The  spatial  discretization  is  chosen  such  that  we  have  about  31  points  /  A,  where  A  is 
the  free-space  wavelength  corresponding  to  the  peak  frequency  of  40  MHz. 

As  a  first  example,  we  consider  the  electromagnetic  field  in  vacuum  generated  by  the  electric- 
current  source  described  above.  For  this  simple  configuration  the  electromagnetic  field  is  known 
in  closed-form  (see,  for  example,  [4]).  In  our  finite-difference  approximation  the  delta  function  is 
approximated  by  a  three-dimensional  triangular  distribution.  The  ana  ytical  solution  is  therefore 
weighted  over  this  distribution.  The  resulting  El  component  of  the  electric  field  strength  at  an 
observation  point  located  5  m  from  the  source  is  shown  in  Figure  1  (solid  line).  Reduced-order 
models  for  the  El  component  at  the  same  observation  point  are  also  shown  in  this  figure  (dashed 
lines).  The  dashed  line  in  Figure  la  shows  the  reduced-order  model  obtained  after  100  iterations, 
in  Figure  lb  after  200  iterations,  and  in  Figure  lc  after  300  iterations.  We  observe  that  the 
interval  in  which  the  reduced-order  model  is  correct  increases  with  the  number  of  iterations. 

As  a  second  example,  consider  the  configuration  of  Figure  2.  A  cube  is  located  1  m  below  an 
air-earth  interface.  The  source  and  the  receiver  are  symmetrically  located  above  this  cube,  5  m 
apart.  The  solid  line  in  Figure  3  shows  the  converged  reduced-order  model  for  the  component 
of  the  electric  field  strength.  The  dashed  line  in  Figure  3a  shows  the  reduced-order  model  obtained 
after  100  iterations,  in  Figure  3b  after  200  iterations,  and  in  Figure  3c  after  300  iterations.  From 
these  results  we  infer  that  the  reduced-order  models  converge  in  a  similar  manner  as  in  the  previous 
example. 


Appendix 

We  prove  the  reciprocity  relations  of  Eqs.  (22)  and  (23).  From  Eq.  (24)  it  follows  that 

(AT)kM26-  =  M25~  Ak  for  k  =  0, 1, ... .  (31) 

Multiply  this  equation  on  the  left  and  on  the  right  by  5EM2_1.  We  obtain 

6EM21(AT)k6E  =  6EAkM21SE  for  k  =  0, 1, ...,  (32) 
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FIGURE  1.  Electric  field  strength  E\  [V/m]  as  a  function  of  time  [ns].  The  solid  line  signifies  the  exact 
result.  The  dashed  line  is  the  reduced-order  model  after  (a)  100  iterations,  (b)  200  iterations,  and  (c) 
300  iterations. 


since  5E5~  =  6~SE  =  8E.  If  we  multiply  Eq.  (31)  on  the  left  by  8BM2  1  and  on  the  right  by  5KM2  \ 
we  get 


FMZl(AT)kP  =  ~5EAkM218H  for  k  =  0, 1, ...,  (33) 

since  £H<5"  =  6~5H  =  —  <5H.  Now,  since  uT  exp(— AtfM^Q  is  a  scalar,  we  can  write 
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FIGURE  3.  Electric  field  strength  E\  [V/mj  as  a  function  of  time  [ns].  The  solid  line  shows  the  con¬ 
verged  reduced-order  model.  The  dashed  line  is  the  reduced-order  model  after  (a)  100  iterations,  (b) 
200  iterations,  and  (c)  300  iterations. 
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Abstract.  In  this  paper,  a  model  order  reduction  approach  based  on  Krylov  subspace 
iteration  is  used  in  conjunction  with  the  Finite  Difference  method  on  unstructured  grids 
for  the  rapid  electromagnetic  characterization  of  electromagnetic  resonant  structures.  The 
proposed  model  order  reduction  algorithm  is  suitable  for  the  non-symmetric  matrices  re¬ 
sulting  from  the  finite  difference  approximation  of  the  hyperbolic  system  of  Maxwell’s 
equations  on  unstructured,  non-orthogonal  grids.  Numerical  examples  are  used  to  validate 
the  proposed  algorithm  and  demonstrate  its  attributes. 


1.  Introduction 

Over  the  past  few  years  model  order  reduction  techniques  have  been  used  to  expedite 
the  numerical  simulation  of  electromagnetic  systems  [1]-[10J.  The  basic  attribute  of  model 
order  reduction  methods  is  the  generation  of  system  response  over  a  broad  frequency  range 
at  the  computational  cost  of  the  inversion  (factorization)  of  the  discrete  system  matrix  only 
at  one  or  a  few  frequencies.  Such  a  fast  frequency  sweep  is  realized  by  generating  a  Pade 
approximation  of  the  response  of  as  high  an  order  as  necessary  for  acceptable  accuracy 
over  the  desired  bandwidth.  Accurate,  continuous  Pade  approximations  of  the  response 
of  electromagnetic  systems  are  extremely  useful  for  the  design  of  high-Q  electromagnetic 
components  such  as  filters,  impedance  transformers,  cavities,  etc.  The  accurate  prediction 
through  simulation  of  the  resonant  behavior  of  such  electromagnetic  components  often  re¬ 
quires  significant  computational  effort.  For  example,  tens  of  thousands  of  time  steps  may  be 
needed  for  the  Finite-Difference  Time-Domain  (FDTD)  method  to  capture  the  resonance 
of  a  high-Q  system.  Also,  it  is  extremely  time-consuming  for  traditional  frequency-domain 
finite  methods  (finite  difference,  finite  element)  to  obtain  broadband  responses  for  complex 
electromagnetic  systems  with  discrete  approximations  involving  tens  or  even  hundreds  of 
thousands  of  unknowns. 

Preliminary  efforts  in  the  application  of  model  order  reduction  practices  focused  on 
the  Asymptotic  Waveform  Evaluation  (AWE)  method  and  its  derivatives.  However,  the 
numerical  instability  of  the  Pade  approximation  generation  process  in  the  AWE  process 
restricted  its  use  to  approximations  of  lower  order.  It  is  well  known  that  continuous 
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electromagnetic  systems  are  of  infinite  order.  Thus,  the  number  of  the  poles  required  for 
the  exact  presentation  of  their  response  is  also  infinite.  Once  a  discrete  method  is  used  to 
approximate  a  continuous  electromagnetic  system  the  generated  discrete  electromagnetic 
system  is  of  finite,  yet  very  high  order.  More  specifically,  the  order  of  the  discrete  system 
equals  the  number  of  state  variables  (unknowns)  in  the  approximation.  The  objective  of 
model  order  reduction  is  to  generate  a  Pade  approximation  of  the  response  of  the  discrete 
electromagnetic  system  maintaining  only  a  limited  number  of  poles  which  is  much  smaller 
than  the  order  of  the  discrete  system,  yet  high  enough  to  model  the  response  accurately 
over  a  broad  frequency  range. 

With  the  success  of  Krylov  subspace  algorithms  for  model  order  reduction  [11]- [14], 
the  issue  of  stability  of  the  generation  of  Pade  approximations  of  high  order  has  been  ad¬ 
dressed  effectively.  However,  with  the  exception  of  the  work  by  Remis  and  van  de  Berg 
[5]  and  Zunoubi  et  al  [10],  most  model  order  reduction  schemes  based  on  Krylov  subspace 
iteration  effect  Pade  approximations  by  indirectly  matching  moments  at  a  finite  expan¬ 
sion  frequency  point.  Even  though  there  are  many  good  reasons  for  doing  so  (e.g.,  finite 
frequency  point  expansions  lead  to  accurate  Pade  approximations  using  only  a  few  tens  of 
poles),  these  approaches  require  the  factorization  of  a  sparse,  yet  very  large  matrix.  This 
is  highly  undesirable  when  problems  involving  hundreds  of  thousands  of  unknowns  need 
be  modeled.  Furthermore,  the  option  of  using  an  iterative  process  for  the  factorization 
becomes  questionable  when  Pade  approximations  of  fairly  high  order  are  needed.  In  view 
of  the  above  concerns,  we  return  to  the  use  of  the  point  at  infinity  as  the  expansion  point 
for  the  generation  of  Pade  approximations.  Contrary  to  the  aforementioned  implementa¬ 
tions  of  such  an  approach  for  the  case  of  skew-symmetric  [5],  [9]  or  symmetric  [10]  matrix 
approximations  to  the  electromagnetic  equations,  we  examine  the  case  of  discrete  approxi¬ 
mations  involving  non-symmetric  matrices.  Such  cases  arise  when  finite  difference  or  finite 
volume  methods  are  used  to  approximate  Maxwell’s  equations  on  unstructured  grids.  The 
paper  begins  with  the  mathematical  description  of  the  model  order  reduction  methodol¬ 
ogy.  Numerical  examples  from  its  application  to  the  characterization  of  electromagnetic 
resonant  structures  are  used  to  validate  the  method  and  demonstrate  its  attributes. 

2.  Semi-Discrete  FORM  of  Maxwell’s  Equations 

For  the  purposes  of  model  order  reduction,  it  is  assumed  that  the  media  are  linear, 
isotropic  passive  and  time-independent.  Thus,  Maxwell’s  curl  equations  in  the  Laplace 
domain  assume  the  form 

Vxl  =  —sfiH  -  Ms  (la) 

V  x  H  =  seE  +  aE  +  Js  (16) 

where  the  permittivity  e,  conductivity  a  and  permeability  ft  are  assumed  to  be  position- 
dependent.  The  semi-discrete  form  of  Maxwell’s  equations  (1)  is  obtained  through  the 
discretization  of  the  curl  operators  by  the  unstructured  finite  difference  method  on  an 
unstructured  version  of  Yee’s  lattice 
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In  the  above,  the  sparse  matrices  Ce,  denote  the  discrete  forms  of  the  curl  operators, 
while  the  matrices  Da,  Bh  and  De  are  diagonal  matrices  that  represent  the  discrete  forms 
of  the  electric  conductivity,  magnetic  permeability,  and  electric  permittivity  properties  of 
the  media,  respectively.  The  vectors  E,  H  are  the  discrete  forms  of  the  unknown  electric 
field  and  magnetic  field  components,  while  the  vectors  Ms,  Js  represent  the  discrete  forms 
of  spatial  distributions  of  the  electric  and  magnetic  current  sources,  respectively.  Because 
of  the  assumption  of  passivity  of  the  media,  the  matrices  D^,  De  are  (symmetric)  positive 
definite,  and  DCT  is  (symmetric)  non-negative  definite.  The  resulting  compact  form  of  (2) 

(G  +  sC)X  =  -BU(s)  (3) 

where  the  vector  U  denotes  the  Laplace  transforms  of  the  current  source  waveforms,  and 
the  spatial  dependence  of  the  sources  is  described  by  the  matrix  B.  With  the  frequency 
shift  s  =  so  +  s',  (3)  may  be  cast  in  the  form 

(I  —  s'A)X  =  RU(s')  (4) 


A  =  -(G  +  soC)”1,  R  =  -(G  +  soC)-1B 


Defining  a  desired  output  vector  as 


Y(S)  =  FX(s) 


a  transfer-function  matrix  can  be  defined  for  the  system 

H(s')=F(I-s'A)-1R  (7) 

The  objective  of  model  order  reduction  techniques  is  to  generate  rational  function  approx¬ 
imations  (Pade  approximations)  to  the  elements  of  H(s')  of  order  q  much  smaller  than 
the  number  of  state  variables  N.  The  most  robust  methodologies  for  the  generation  of 
high-order  Pade  approximations  of  dynamic  systems  are  those  based  on  Krylov  subspace 
iteration  using  the  Lanczos  and  Arnoldi  algorithms. 


3.  The  Process  of  Model  Order  Reduction 

In  order  to  illustrate  how  Krylov  subspace  iteration  methods  can  be  sued  for  model 
order  reduction  purposes,  consider  the  change  of  variable 

X  =  VX  (8) 

where  X  is  the  reduced  state  vector.  Its  length  q  is  assumed  to  be  much  smaller  than  the 
length  N  of  the  original  vector  X.  The  transformation  matrix  V  is  of  dimension  Nxq  and 
is  assumed  to  be  orthonormal,  i.e.  VTV  =  Iq,  where  Iq  is  the  identity  matrix  of  dimension 
q.  Substitution  of  (8)  into  (3)  and  (6),  Mowed  by  multiplication  on  the  left  of  (3)  by  V 

'  [(VTGV)  +  <s(VTCV)]X  =  -(VtB)U(S)  (9) 
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(10) 


Y  =  (FV)X 

Thus,  a  reduced-order  system  may  be  defined  in  terms  of  the  matrices 

G  =  VrGV,  C=VTCV,  B  =  VtB,  F  =  FV  (11) 

Because  of  the  low  order,  the  eigendecomposition  of  this  system  is  very  inexpensive,  and 
provides  for  a  fast  calculation  of  the  system  response  at  any  desirable  frequency.  From  the 
above  it  is  clear  that  the  generation  of  the  congruence  transformation  matrix  V  is  the  key 
to  the  successful  implementation  of  such  a  model  order  reduction  methodology.  This  can 
be  achieved  using,  for  example,  the  block  Arnoldi  algorithm  to  generate  a  Krylov  subspace 
using  the  matrices  A  and  R  in  (5)  as  described  in  [11].  From  the  point  of  view  of  CPU  and 
memory  requirements,  the  factorization  of  the  matrix  (G  +  soC),  and  the  storing  of  the 
transformation  matrix  V  are  undesirable.  Three-dimensional  electromagnetic  analysis  of 
realistic  problems  using  finite  methods  may  lead  to  a  semi-discrete  system  with  hundreds  of 
thousands,  even  millions,  of  unknowns.  Thus,  despite  its  sparsity,  the  factorization  of  (G+ 
s0C)  may  become  prohibitive.  A  possible  way  of  avoiding  the  factorization  of  the  matrix 
is  to  use  the  Lanczos  and  Arnoldi  algorithms  for  the  generation  of  Pade  approximations 
with  expansion  point  at  infinity  (so  — *  oo).  It  will  be  shown  that  the  implementation  of 
such  an  algorithm  is  fairly  straightforward  when  used  in  conjunction  with  FD-TD- based 
and  Finite- Volume  Time-Domain  (FV-TD)-based  field  solvers. 

The  development  of  the  algorithm  begins  by  casting  the  state  system  of  (3),  (6)  in  the 

form 

(si  -  A)X(s)  =  RU(s)  (12) 

Y(s)  =  FX(s)  (13) 

where 

A  =  -C~1G,  R  =  C-1B  (14) 

It  is  noted  that  that  the  inversion  of  C  is  trivial,  since  it  is  a  diagonal  matrix  in  the 
semi-discrete  electromagnetic  systems.  Thus,  the  block  Arnoldi  algorithm  may  be  cheaply 
used  to  generate  the  Krylov  subspace  needed  for  the  passive  reduction  of  the  system. 
Furthermore,  it  should  be  apparent  that  the  matrix- vector  multiplication  in  each  iteration 
of  the  block  Arnoldi  algorithm  is  computationally  equivalent  to  one  update  for  the  electric 
and  magnetic  fields  everywhere  in  the  finite  grids. 

In  the  case  of  the  memory  shortage  associated  with  the  storage  of  the  congruence 
transformation  matrix  V,  an  alternative  of  the  block  Arnoldi  algorithm,  the  block  Lanczos 
algorithm  [12],  can  be  implemented  for  the  generation  of  the  reduced-order  macromodel 
of  electromagnetic  systems.  The  block  Lanczos  algorithm  reduces  the  system  matrix  A  in 
(4)  to  a  q  x  q  block  tridiagonal  matrix,  Tg,  such  that 

WrAV  =  DT„  WTV  =  D  (15) 

where  the  N  x  q  matrix  V  forms  a  basis  for  the  Krylov  subspace  Kr( A,  R),  and  the  N  x  q 
matrix  W  forms  a  basis  for  the  Krylov  subspace  Kr(AT,  R).  The  two  matrices  V  and  W 
are  generated  to  be  bi-orthogonal.  Hence,  the  matrix  D  is  a  diagonal  matrix.  It  should  be 
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emphasized  that  the  matrix  A  in  (4)  may  be  an  arbitrary  non-symmetric  matrix.  In  such 
a  case  it  is  known  that  the  block  Lanczos  algorithm  may  break  down  due  to  numerical 
round-off  error.  One  of  the  remedies  for  such  breakdown  is  to  use  the  looked-ahead  Lanczos 
algorithm.  Alternatively,  a  modified  Lanczos  algorithm  may  be  used  for  the  generation  of 
the  Krylov  subspace  for  the  case  of  non-symmetric  matrices  such  that 

WTV  =  Iq  (16) 


The  details  of  the  algorithm  have  been  given  in  [13]  and  will  not  be  repeated  here.  The 
model  order  reduction  affected  by  the  Lanczos  algorithm  may  be  illustrated  by  the  change 


of  variables 

x  =  vx 


(17) 


Substituting  (17)  into  (12)  and  multiplying  on  the  left  of  (12)  by  WT  yield 


(si  -  TV)X  =  WtRU(s) 


(18) 


Finally,  the  response  of  the  reduced-order  system  is  given  by 

Y(s)  =  FV(sI  -  T9)"1WtRU(s)  (19) 


As  already  mentioned,  the  matrix-vector  multiplications  involving  the  matrices  A  and  AT 
are  the  curl  operations  based  on  the  discretization  of  fields.  More  specifically,  the  matrix 
multiplication  associated  with  the  matrix  A  is  identical  to  the  curl  operation  done  by  the 
FD-TD  or  the  FV-TD  algorithm  for  one  update  of  the  fields.  On  the  other  hand,  the 
multiplication  involving  the  matrix  AT  is  a  slightly  different  operation,  which  nevertheless 
can  be  affected  also  as  a  discrete  curl  operation.  Thus,  each  iteration  step  of  the  block 
Lanczos  algorithm  is  computationally  equivalent  to  two  FD-TD  or  FV-TD  field  updates. 

Next,  it  is  important  to  recognize  the  fact  that  the  matrices  V  and  W  do  not  have 
to  be  stored,  even  though  they  seem  to  be  needed  for  the  generation  of  the  reduced-order 
model  in  view  of  (19).  As  explained  in  [12],  the  matrices  V  and  W  are  constructed  in  such 
a  manner  that  it  is 


R  =  V 


ai 
0  ’ 


F  =  [a|-  0]WT 


(20) 


where  the  matrices  a,  and  a2  are  upper  triangular  matrices.  Using  the  bi-orthogonabty  of 
V  and  W,  the  response  of  the  system  may  be  cast  in  the  form 


Y(s)  =  [af 


OKsI-T,)-1 


ai 

0 


U(s) 


(21) 


Clearly,  all  matrices  in  this  final  expression  of  the  response  are  of  dimension  qxq.  These 
are  the  only  arrays  needed  to  be  stored  in  the  block  Lanczos  algorithm  for  model  order 
reduction  purposes.  It  should  be  mentioned  that  the  Taylor  expansions  of  the  impulse 
response  of  the  semi-discrete  system  of  (12)  around  t  =  0  agree  with  the  reduced  system 
of  (21)  up  to  the  first  q  terms,  where  q  is  the  order  of  the  reduced  system.  The  immediate 
consequence  of  this  result  is  that  the  reduced  system  is  expected  to  provide  very  good 
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accuracy  for  the  early  period  of  the  transient  response.  An  estimate  of  the  time  interval 
has  been  developed  in  [5]  in  terms  of  the  order  of  the  reduced  model  and  a  properly  defined 
norm  of  the  matrix  A.  According  to  this  estimate,  the  higher  the  order  of  the  reduced 
model,  the  longer  the  time  interval  over  which  the  transient  response  can  be  captured 
by  the  reduced  model.  The  rapid  calculation  of  the  electromagnetic  response  of  resonant 
systems  can  be  affected  by  directly  extracting  the  response  of  the  reduced-order  model 
in  the  Laplace  domain,  instead  of  seeking  the  eigenvalues  of  the  tridiagonal  matrix  Tg  of 
the  reduced-order  system,  if  the  excitation  sources  are  assumed  to  be  impulse  functions 
in  time.  Clearly,  the  eigenvalues  of  the  tridiagonal  matrix  Tg  are  only  a  subset  of  the 
eigenvalues  of  the  original  matrix  A.  In  particular,  for  the  case  where  the  expansion  point 
is  the  point  at  infinity,  the  generated  Krylov  subspaces  are  spanned  by  the  eigenvectors 
corresponding  to  the  highest  eigenvalues.  Consequently,  Pade  approximations  of  very 
high  order  may  be  necessary  if  some  of  the  lower  eigenvalues  of  the  sytem  are  needed.  This 
is  one  of  the  major  drawbacks  of  Pade  approximations  generated  using  the  point  at  infinity 
as  the  expansion  point. 

4.  Numerical  Validation 

The  following  examples  are  aimed  at  demonstrating  the  application  of  the  proposed 
methodology  to  the  rapid  electromagnetic  analysis  of  resonators.  From  the  description  of 
the  methodology  it  should  be  clear  that  the  eigen-analysis  is  affected  through  the  impulsive 
excitation  of  the  system  and  the  subsequent  generation  of  the  reduced  representation  of  its 
response  using  as  basis  eigenvectors  in  the  generated  Krylov  subspace.  We  consider  first  a 
simple  cylindrical  resonant  cavity  for  which  closed-form  solutions  are  available.  The  radius 
of  the  cavity  is  lm.  The  length  of  the  cavity  is  equal  to  its  radius.  The  semi-discrete  form  of 
Maxwell’s  equations  (1)  is  obtained  by  the  non-orthogonal  discretization  of  the  structure. 
The  resulting  system  matrix  is  non-symmetric.  Table  I  compares  the  resonant  frequencies 
of  the  reduced  order  model,  generated  by  the  Lanczos  algorithm  with  expansion  frequency 
point  at  infinity,  with  the  analytically  obtained  results.  Also  shown  are  the  resonant 
frequencies  obtained  using  the  FD-TD  algorithm  on  the  non-orthogonal  grid.  Very  good 
agreement  is  observed  for  the  lowest  four  modes.  The  order  of  the  reduced  model  is  250. 
As  already  explained  in  the  previous  section,  the  computational  cost  associated  with  the 
generation  of  the  reduced  model  of  order  250  is  equivalent  to  500  FD-TD  updates.  To 
achieve  the  resolution  of  A /  =  1.5 MHz  for  this  problem,  the  non-orthogonal  FD-TD 
algorithm  requires  4096  time  steps. 

Table  I.  Resonant  frequencies  for  an  empty  cylindrical  cavity  {MHz). 


Mode 

Analytic 

FDTD 

MORe 

TMoio 

114.7 

113.6 

113.8 

TEni 

173.8 

173.3 

173.5 

TMn  o 

183.0 

181.8 

181.0 

TMon 

188.9 

188.9 

188.0 

The  second  example  concerns  the  eigenfrequency  analysis  of  a  cylindrical  cavity  loaded 
by  a  dielectric  rod  (er  =  37.6).  The  radius  and  length  of  the  cavity  are  0.5in  and  0.55m, 
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respectively.  The  radius  of  the  dielectric  rod  is  0.394m.  Table  II  compares  the  resonant 
frequencies  produced  by  a  reduced  model  of  order  1000  with  those  obtained  through  a 
mode-matching  solution,  and  a  non-orthogonal  FD-TD  analysis  of  the  problem.  Once 
again,  very  good  agreement  is  observed.  Since  the  order  of  the  reduced  model  is  1000, 
the  computational  cost  of  its  generation  is  equivalent  to  2000  FD-TD  iterations.  The  non- 
orthogonal  FD-TD  algorithm  requires  about  16, 800  updates  to  achieve  the  resolution  of 
A  /  =  0.05  GHz. 

Table  II.  Resonant  frequencies  for  a  cylindrical  cavity  loaded  by  the  dielectric  rod  {GHz). 


Mode 

Mode  —  matching 

FDTD 

MORe 

_ 

1.50 

1.47 

1.49 

_ 

2.44 

2.41 

2.42 

HEn 

2.50 

2.50 

2.51 

— 

3.07 

3.06 

TMon 

3.38 

3.31 

3.32 

he2U 

3.38 

3.48 

3.47 

5.  Conclusions 

Model  order  reduction  methodologies  based  on  the  Amoldi  and  the  Lanczos  algo¬ 
rithms  have  been  discussed  in  the  paper.  In  order  to  overcome  the  major  obstacle  of  the 
CPU  and  memory  overhead  associated  with  the  use  of  these  algorithms  with  finite  fre¬ 
quency  expansion  points,  the  Lanczos  algorithm  with  expansion  point  the  point  at  infinity 
is  an  attractive  alternative  when  discrete  electromagnetic  systems  involving  hundreds  of 
thousands  of  unknowns  need  to  be  modeled. 

For  the  three-dimensional  electromagnetic  resonators  analyzed  in  this  paper,  the  dis¬ 
crete  electromagnetic  model  was  generated  using  finite  differences  on  an  unstructured  Yee’s 
lattice.  Such  a  discretization  leads  to  a  non-symmetric  system  matrix  approximation  of  the 
coupled  system  of  Maxwell’s  curl  equations.  Despite  the  potential  of  numerical  breakdown 
of  the  Lanczos  algorithm  for  such  cases,  no  problems  were  encountered  for  the  examples 
considered  in  this  paper.  In  any  case,  remedies  exist  for  avoiding  such  breakdowns;  hence, 
this  breakdown  concern  is  by  no  means  a  prohibitive  factor  in  the  application  of  the  Lanczos 
algorithm  with  non-symmetric  discrete  approximations  of  electromagnetic  systems. 
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Abstract 

This  paper  presents  some  recent  advances  in  automated  electromagnetic  (EM)  optimization  of  microwave  components  and 
circuits.  We  review  the  challenge  of  EM  optimization  and  present  illustrative  examples,  involving  both  direct  optimization 
using  commercial  EM  simulators  and  space  mapping  optimization.  We  review  a  robust  new  algorithm  for  EM  optimization 
called  TRASM  (Trust  Region  Aggressive  Space  Mapping),  which  integrates  a  trust  region  methodology  with  the  aggressive 
space  mapping  (ASM)  technique.  The  trust  region  ensures  that  each  iteration  results  in  improved  alignment  between  the 
coarse  and  fine  models  and  recursive  multi-point  parameter  extraction  improves  the  conditioning  of  the  extraction  phase. 
We  exploit  Sonnet’s  em,  MATLAB,  OSA90,  Datapipe,  interpolation  and  database  techniques  and  Geometry  Capture. 
Examples  include  the  EM  optimization  of  microstrip  T-junctions  and  an  HTS  filter. 


INTRODUCTION 

Future  CAE  systems  for  high-speed,  wireless  and  microwave  circuits  and  systems  will  not  be  regarded  as  complete  without 
available  powerful  EM  simulators  integrated  with  optimization  capabilities.  Optimal  design  of  active  and  passive  devices, 
circuits  and  systems  is  expected  to  be  physically  and  electromagnetically  based,  to  include  electrical,  mechanical  and  thermal 
effects.  Future  developments  in  integrated  CAE  tools  must  concurrently  link  geometry,  layout,  physical,  EM  and  process 
simulations,  with  performance,  yield,  cost,  system  specifications,  manufacturability  and  testability  in  a  manner  transparent  to 
the  designer  [1]. 

The  sienificant  features  of  EM  simulators  over  empirical  model/circuit-theory  based  simulators  is  their  unsurpassed  accuracy, 
extended  validity  ranges,  and  their  capability  of  handling  more  arbitrary  geometrical  shapes.  Increasingly  more  complex 
structures  are  being  simulated  accurately  and  quickly  as  computational  techniques  improve,  making  optimization  more  practical 
and  attractive  with  tools  such  as  HP  HFSS  [2].  As  was  discussed  in  the  1997  ACES  Conference  13]  EM  simulators  will  not 
realize  their  full  potential  to  the  designer  unless  they  are  optimizer-driven  to  automatically  adjust  designate  parameters  [4,5,6], 


J.W.  Bandler  is  also  with  Bandler  Corporation,  P.O.  Box  8083,  Dundas,  Ontario,  Canada  L9H  5E7. 

D.G.  Swanson,  Jr.,  is  with  the  M/A-COM  Division  of  AMP,  Inc.,  Central  R&D  Group,  M/S  L3, 100  Chelmsford  Street,  Lowell,  MA  01853-3294. 

This  work  was  supported  in  part  by  the  Natural  Sciences  and  Engineering  Research  Council  of  Canada  (NSERC)  under  Grants  OGP0007239,  STP0201 832, 
TRIO,  Com  Dev,  Nanowave  Technologies  and  through  the  Micronet  Network  of  Centres  of  Excellence.  M.H.  Bakr  was  supported  under  an  Ontario  Graduate 
Scholarship  and  N.  Georgieva  by  a  Postdoctorate  Fellowship  from  NSERC. 


326 


This  paper  presents  some  recent  advances  in  automated  EM  optimization  of  microwave  components  and  circuits.  We  review 
the  challenge  of  EM  optimization  and  present  illustrative  examples,  involving  both  direct  optimization  using  commercial  EM 
simulators  and  space  mapping  (SM)  optimization  [7,8].  SM  optimization  is  not  yet  available  in  user-friendly  form  in  a 
commercial  package.  The  powerful  SM  concept  uses  a  “coarse”  model  to  carry  out  the  bulk  of  computations  in  the 
optimization  process.  The  coarse  model  can  be  an  empirical  model,  an  equivalent  circuit  model  or  an  EM  model  with  a  coarse 
resolution.  A  “fine”  or  accurate  EM  model  is  used  to  align  the  coarse  model  and  guide  the  optimization  process. 

We  review  what  is  claimed  to  be  a  robust  new  algorithm  for  EM  optimization  called  Trust  Region  Aggressive  Space 
Mapping  (TRASM)  [9],  which  integrates  a  trust  region  methodology  with  the  aggressive  space  mapping  (ASM)  technique 
[8].  The  trust  region  is  to  ensure  that  each  iteration  results  in  improved  alignment  between  the  coarse  and  fine  models 
needed  to  execute  ASM.  A  recursive  multi-point  parameter  extraction  process  is  proposed  to  improve  the  conditioning  of 
the  extraction  phase. 

We  exploit  Sonnet’s  ern  [10],  MATLAB  [11],  OSA90  [12]  and  Empipe  [12],  In  addition  we  implement  OSA’s  Datapipe 
technology,  interpolation  and  database  techniques,  as  well  as  Geometry  Capture  [12].  Examples  include  the  direct  EM 
optimization  of  microstrip  T-junctions  and  the  space  mapping  optimization  of  an  HTS  filter. 


DIRECT  OPTIMIZATION:  DESIGN  OF  OPTIMAL  MICROSTRIP  T-JUNCTIONS 

“Direct”  optimization  has  been  reported  by  Bandler  et  al.  [4,5,13].  We  describe  here  some  new  results.  We  compare  four 
different  configurations  to  compensate  discontinuities  in  a  microstrip  T-junction  by  optimization  over  a  broad  frequency 
range.  Three  of  them  are  taken  from  the  literature,  one  is  introduced  here.  The  T-junctions  are  optimized  to  achieve  the 
minimum  possible  mismatch  at  the  three  ports. 

The  T-junction  configuration  is  symmetric  and  is  connected  to  three  50  Q  microstrip  lines.  The  first  configuration  shown  in 
Fig.  1(a)  was  suggested  by  Chadha  and  Gupta  [14].  The  discontinuity  is  compensated  by  removing  a  triangle  portion  from 
the  basic  T-junction.  The  configuration  of  Fig.  1(b)  was  introduced  in  [15]  but  no  optimization  was  performed.  Here,  the 
compensation  is  done  by  modifying  the  common  arm  and  the  two  comers.  The  T-junction  in  Fig.  1(c)  is  introduced  here:  it 
is  similar  to  that  in  Fig.  1(b).  The  one  in  Fig.  1(d)  is  suggested  by  Dydyk  [16]  but  no  optimization  was  applied.  The  T- 
junctions  of  Figs.  1(a)  and  1(b)  were  compared  by  Swanson  [17]  without  optimization.  The  optimization  variable  for  Fig. 
1(a)  is  the  side  length  r  at  tfequal  to  30°,  45°  and  60°.  For  Figs.  1(b),  1(c)  and  1(d)  the  optimization  variables  are  x  and  y. 
These  variables  are  adjusted  to  satisfy  the  theoretical  simultaneous  match  condition  which  is  9.54  dB  (20  log  (1/3))  return 
loss  at  the  three  ports.  The  specifications  are  |  Sn  |  <  1/3,  |  S22 1  <  1/3  in  the  frequency  range  2  GHz  to  16  GHz.  The  width 
w,  the  height  h  and  the  relative  dielectric  constant  &  are  fixed  during  optimization.  We  use  Sonnet’s  em  [10]  to  compute 
the  S  parameters  of  the  different  T-junction  configurations,  the  minimax  optimizer  in  OSA90/hope  [12]  to  perform 
optimization  and  Empipe  [12]  to  parameterize  the  geometry  of  the  T-junctions. 

Optimization  Results 

The  side  length  r  of  the  triangle  in  Fig.  1(a)  is  optimized  for  $  equal  to  30°,  45°  and  60°.  Table  I  shows  the  results.  The  T- 
junctions  in  Figs.  1(b),  1(c)  and  1(d)  are  optimized  here  with  respect  to  x  and  y,  each  from  five  starting  points.  Unique 
solutions  are  reached  (Table  II).  Fig.  2  compares  responses  in  the  frequency  range  2  GHz  to  20  GHz  for  the  optimized  T- 
junctions  in  Fig.  1.  It  is  clear  that  the  T-junction  in  Fig.  1(a)  with  9  equal  to  30°  gives  the  worst  results.  The  other  T- 
junctions  give  satisfactory  results  with  minor  differences.  Fig.  3  compares  responses  for  the  optimized  T-junctions  of  Figs. 
1(b),  1(c)  and  1(d).  The  T-junction  of  Fig.  1(d)  gives  the  best  response.  The  response  of  the  T-junction  of  Fig.  1(c) 
introduced  here  is  slightly  better  than  the  T-junction  of  Fig.  1(b). 
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Fig.  1.  Four  different  T-junctions  with  w=  24  mil,  h=25  mil  and  &=  9.9. 
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Fig.  2.  \SU\  and  j»S22  ]  for  the  basic  T-junction  and  the  optimized  T-junctions  of  Fig.  1 . 


Nf 


Fig.  3.  \SU\  and  |iS22|  for  the  optimized  T-junctions  of  Figs.l(b),  1(c)  and  1(d).  The  range  is  expanded  w.r.t.  Fig.  2. 


TABLE  I 

THE  OPTIMAL  VALUE  OF  r  AT  9  EQUAL  TO  30°,  45°  AND  60° 
FOR  THE  T-JUNCTION  IN  FIG.  1(a) 


e 

The  optimal  value  of  r 

30° 

1.556  w 

45° 

1.355  w 

60° 

1.158  w 

TABLE  II 

THE  OPTIMAL  VALUES  OF  THE  PARAMETERS  x  AND  y  FOR 
THE  T-JUNCTIONS  IN  FIGS.  1(b),  1(c)  AND  1(d) 


T-junction 

Optimal  value  of  x 

Optimal  value  of  y 

_ Fig-  Kb) _ 

0.9250  w 

0.583  w 

Fig.  1(c) 

0.7271  w 

0.7917  w 

_ Fig-  Kd) _ 

0.1  w 

0.9167  w 

OPTIMIZATION  THROUGH  SPACE  MAPPING 

Space  mapping  (SM)  aims  at  aligning  two  different  simulation  models:  the  so-called  “coarse”  model,  typically  an  empirical 
circuit  simulation  and  a  “fine”  model,  typically  a  foil  wave  EM  simulation  [1,  7-9].  The  concept  is  illustrated  in  Fig.  4, 
where  the  coarse  model  is  called  the  “optimization”  model  and  the  fine  model  is  identified  as  the  “validation”  model. 

The  technique  combines  the  accuracy  of  the  fine  (validation)  model  with  the  speed  of  the  coarse  (optimization)  model 
during  the  circuit  optimization  process.  Parameter  extraction  is  a  crucial  part  of  the  technique.  During  this  step  the 
parameters  of  the  coarse  model  whose  response  matches  the  fine  model  response  are  obtained.  The  extracted  parameters 
may  not  be  unique,  causing  the  technique  to  fail  to  converge  to  the  optimal  design. 
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Fig.  4.  Concept  of  aligning  two  spaces  by  the  space  mapping  approach  [8]. 

We  briefly  review  aggressive  space  mapping  (ASM)  EM  optimization.  The  vector  of  parameters  of  the  fine  model  is  referred 
to  as  xem-  The  vector  of  parameters  of  the  coarse  model  is  referred  to  as  Xos.  The  first  step  of  the  ASM  technique  is  to  obtain 
the  optimal  design  of  the  coarse  model  x0s .  The  technique  aims  at  establishing  a  mapping  P  between  the  two  spaces  [8] 

Xos  =  P(xem)  (1) 

such  that 

(2) 

where  Rem  is  the  vector  of  fme  model  responses,  Ros  is  the  vector  of  coarse  mode  responses  and  ||  j|  is  a  suitable  norm. 
The  error  function 

f  =  P(Xem)-Xos  (3) 

is  first  defined.  The  final  fine  model  design  is  obtained  and  the  mapping  is  established  if  a  solution  for  the  system  of 
nonlinear  equations 

AXem)  -  0  (4) 

is  found. 


Let  x'Jl  be  the  rth  iterate  in  the  solution  of  (4).  The  next  iterate  *£*>  is  found  by  a  quasi-Newton  iteration 

*£‘>  =  *2  + A'0  (5) 

where  f£‘^  is  obtained  from 

S)  <« 

and  B{,)  is  an  approximation  to  the  Jacobian  of  the  vector / with  respect  to  xem  at  the  rth  iteration.  The  matrix  B  is  updated 


at  each  iteration  using  Broyden’s  update  [18]. 


It  is  seen  from  (1>(3)  that  the  vector  function /is  obtained  by  evaluating  P(xOT).  This  can  be  achieved  through  the  process 
of  parameter  extraction.  Parameter  extraction  involves  solving  a  subsidiary  optimization  problem. 
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During  parameter  extraction  the  parameters  of  the  coarse  model  whose  response  matches  the  fine  model  response  are  obtained. 
It  can  be  formulated  as 

minimize  -*«(*«)| .  (7) 

Xos 

The  extracted  parameters  may  not  be  unique,  causing  the  space  mapping  optimization  technique  to  fail. 


A  TRUST  REGION  AGGRESSIVE  SPACE  MAPPING 
ALGORITHM  FOR  EM  OPTIMIZATION 

The  TRASM  algorithm  [9]  depicted  in  Fig.  5  integrates  a  trust  region  methodology  [19]  with  ASM.  The  trust  region  is  to 
ensure  that  each  iteration  results  in  improved  alignment  between  the  coarse  and  fine  models  needed  to  execute  ASM.  To 
improve  the  uniqueness  of  the  extraction  phase  Bakr  et  al.  [9]  developed  a  recursive  multi-point  parameter  extraction 
process,  which  exploits  all  available  EM  simulations  already  performed.  TRASM  automates  the  selection  of  fine  model 
points  used  for  the  multi-point  parameter  extraction  process.  Also,  the  current  approximation  to  the  mapping  between  the 
two  spaces  is  integrated  into  the  parameter  extraction  step. 


Fine  Model 
Simulation 


Parameter 

Extraction 


Fig.  5.  The  current  implementation  of  the  TRASM  algorithm  [9]  using  MATLAB  [11]. 


HIGH-TEMPERATURE  SUPERCONDUCTING  FILTER  EXAMPLE 

We  review  optimization  of  a  narrow  band  high-temperature  superconducting  (HTS)  filter  [20]  (Fig.  6).  The  specifications 
are  |  S2i  I  >  0.95  in  the  passband  and  |  S21 1  <  0.05  in  the  stopband,  where  the  stopband  includes  frequencies  below  3.967 
GHz  and  above  4.099  GHz  and  the  passband  lies  in  the  range  [4.008  GHz,  4.058  GHz].  The  optimization  variables  are  L\, 
Lz,  Lz,  S],  Sz  and  S3.  We  take  L0  =  50  mil  and  W=  7  mil.  For  complete  details  see  [9]  and  [20].  The  coarse  model  exploits 
the  empirical  microstrip  line  models,  coupled  lines  and  open  stubs  available  in  OSA90/hope.  The  fine  model  employs  a 
fine-grid  em  simulation.  The  coarse  model  is  first  optimized  using  the  OSA90/hope  minimax  optimizer.  The  corresponding 
fine  model  response  is  shown  in  Fig.  7.  The  final  EM  design  was  obtained  in  5  iterations  which  required  8  fine 
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Fig.  6.  The  structure  of  the  HTS  filter  [20], 


Fig.  8.  The  optimal  coarse  model  response  ( — )  and  the 
optimal  fine  model  response  (o)  for  the  HTS 
filter  [9]. 


frequency  (GHz) 


Fig.  7,  The  optimal  coarse  model  response  ( — )  and  the 

fine  model  response  (o)  at  the  starting  point  for  the 
HTS  filter  [9]. 


3.967  3 .989  4.011  4.033  4.055  4,077  4.099 

frequency  (GHz) 

Fig.  9.  The  optimal  coarse  model  response  ( — )  and  the 

optimal  fine  model  response  (o)  for  the  HTS  filter 
in  the  passband  [9]. 


model  simulations.  The  final  fine  model  response  at  this  design  is  shown  in  Fig.  8.  The  passband  ripples  are  shown  in  Fig. 
9. 
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CONCLUSIONS 

Advances  in  computational  electromagnetics  and  optimization  technology  allow  direct  exploitation  of  EM  simulation 
techniques  in  circuit  design  optimization.  In  this  paper  we  have  reviewed  recent  advances  in  EM  optimization  of  microwave 
components  and  circuits.  We  presented  illustrative  examples,  involving  both  direct  optimization  using  commercial  EM 
simulators  and  space  mapping  optimization.  The  space  mapping  technique  is  a  powerful  approach  to  design  optimization  when 
extremely  CPU  intensive  simulators  are  used,  combining  the  speed  of  circuit-level  optimization  with  the  accuracy  of  EM 
simulations.  We  reviewed  a  new  algorithm  for  EM  optimization  called  TRASM. 
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Examples  include  the  EM  optimization  of  microstrip  T-junctions  and  an  HTS  filter.  In  particular,  we  compared  four 
different  configurations  to  compensate  discontinuities  in  a  T-junction.  A  new  configuration  has  been  introduced  here.  The 
HTS  filter  is  a  particularly  challenging  optimization  problem,  characterized  as  it  is  by  narrow  bandwidth  (about  1.24  %) 
and  high  dielectric  constant  (about  24)  [20]. 
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Abstract  -  In  this  paper,  the  Genetic  Algorithm  (GA)  [1, 2,3,4]  is  employed  as  an  efficient  optimization 
tool  to  derive  SPICE-type  equivalent  circuits  of  microwave  components,  and  of  discontinuities  in  coaxial 
lines,  from  their  specified  S-parameters.  Nodal  analysis  and  techniques  are  utilized  in  conjunction  with  the 
GA  to  extract  these  equivalent  circuits,  whose  component  values  are  the  parameters  optimized  by  the  GA. 
Numerical  examples,  that  demonstrate  the  fact  that  GA  yields  excellent  results  for  the  cases  studied,  are 
included  in  the  paper. 

1.  Extraction  of  Equivalent  Circuits  Using  the  Genetic  Algorithm  (GA) 

The  GA  is  employed  in  this  work  to  derive  the  equivalent  circuit  of  microwave  discontinuities  and 

devices.  This  procedure  considers  each  of  the  component  values  of  the  circuit  as  a  parameter  p,  to  be 
optimized  via  the  GA.  Each  of  the  parameters  is  assigned  a  lower  and  an  upper  limit,  say  pT“  and  p®3* , 
and  is  represented  in  a  discretized  binaiy-coded  form  with  N,bits  called  a  gene.  The  binary  string  of  all 
the  genes  comprising  the  design  is  known  as  a  chromosome. 

The  Objective  function  F  for  the  optimization  procedure  is  chosen  to  be 

r-  ® 

n=l  i=l  >1 

where  N  f  =  Total  number  of  frequencies  of  interest 

S  =  Specified  S-parameters  of  the  equivalent  circuit 

S?A  =  S-parameters  calculated  from  the  equivalent  circuit  selected  by  the  GA 
For  a  given  set  of  specified  S-parameters,  the  GA  strives  to  minimize  the  objective  function  F  by  selecting 
the  component  values  of  the  equivalent  circuit  The  GA  starts  with  an  initial  population  P  0  of  JV-bit 
chromosome  binary  strings.  Each  string  is  a  candidate  for  the  design.  The  GA  then  iteratively  generates  a 

new  population  PM ,  derived  from  the  previous  one  P, ,  through  the  application  of  selection,  crossover, 
and  mutation,  to  evolve  towards  a  better  solution.  In  the  process  of  applying  the  optimization  procedure 
based  on  (1),  we  need  to  compute  the  S~A,  the  S-parameters  of  the  circuit  chosen  by  the  GA,  at  each 
iteration  step. 

2.  Derivation  of  S-Parameters  from  Nodal  Equations 

We  introduce  a  technique  that  employs  nodal  analysis  to  derive  the  S-parameters  of  microwave 
components  and  discontinuities  from  their  Spice-lype  description  of  the  equivalent  circuit.  The  nodal 
equations  of  aN-node  circuit  can  be  expressed  in  the  following  form, 

(I}nzi  =  [F]wxN  {V}h*  <2> 
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where  {1}  and  {V}  are  current  and  voltage  vectors  at  the  N  nodes  for  the  representative  circuit  and  [F]  is 
the  admittance  matrix.  Since  only  input  port  1  and  output  port  2  contain  the  accessible  nodes,  and  are  of 
interest  to  us  in  defining  the  S-parameters  of  the  two-port  network,  we  can  manipulate  (2)  into  the 
convenient  form 
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The  S-parameter  matrix  of  the  circuit  can  now  be  calculated  from 

s-=[/-Tfl][/+y*rJ 

where  S  =  5-parameter  matrix  of  the  circuit 
/  =  Identity  matrix 


(9) 


(10) 
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The  above  procedure  for  deriving  S-parameters  is  used  in  the  GA  optimization  process  to  choose  the 
component  values  of  the  equivalent  circuits. 

3.  Case  Examples 


3.1  Right-angled  Bend  .  .  , 

In  the  first  example  we  consider  a  right-angled  bend  in  a  rmcrostnp  line.  The  geometry  and  the 
equivalent  circuit  of  the  discontinuity,  derived  from  Gupta  [5],  are  shown  in  Fig.  1.  We  begin  with  a  set  of 
S-parameters  that  was  calculated  for  the  right-angled  bend  by  using  a  commercially-available  software 
package  called  Ensemble  [6].  The  component  values  inserted  between  the  nodes  are  the  parameters  to  be 
optimFzed  by  the  GA.  The  GA  views  the  S-parameter  information  as  the  specifications,  and  searches  for 
the  component  values  of  the  equivalent  circuit  that  meets  these  specifications.  Figure  2  shows  the  results 
obtained  from  application  of  the  technique.  The  component  values  derived  from  the  GA  are  nearly 
identical  to  the  ones  given  in  Fig.l.  As  a  consequence,  the  S-parameters  computed  from  the  GA-denved 
circuit  duplicate  the  ones  calculated  by  Ensemble  almost  exactly 

3.2  Flat  Spiral  Inductor  .  ,  . 

Next  we  study  a  flat  spiral  inductor  model  [7],  whose  physical  configuration  and  the  known 
equivalent  circuit  are  given  in  Fig.  3.  The  parameters  Su  and  S2i,  derived  from  component  values  shown 
in  the  first  column  of  Table  1,  were  computed  via  EESOF  [8],  and  were  designated  as  the  specified  S- 
parameters.  The  GA  views  the  specified  S-parameters  as  the  design  objectives.  The  values  of  the 
components  chosen  by  the  GA  are  also  listed  in  Table  1.  Figure  4  shows  a  good  agreement  between  the 
specified  S-parameters  and  the  ones  derived  from  the  GA. 

3.3  Parallel  Plate  Capacitor  .  .  .  .  - 

We  then  consider  the  problem  of  determining  the  component  values  for  the  equivalent  circuit  ot  a 
parallel  plate  capacitor,  whose  physical  configuration  and  equivalent  circuit  (see  [7])  are  shown  in  Fig.  5. 
The  component  values  of  the  circuit  are  listed  in  Table  2.  Once  again  EESOF  was  utilized  to  compute  the 
S-parameters  that  were  viewed  as  the  design  objectives.  The  GA  then  chose  the  component  values  that  are 
also  shown  in  Table  2.  Figure  6  plots  the  magnitude  of  the  specified  S-parameters  and  compares  them 
with  those  computed  from  the  GA-selected  equivalent  circuit. 

3.4  Coaxial  Discontinuity  Problem 

In  this  example,  we  deal  with  the  problem  of  extracting  the  equivalent  circuit  of  a  coaxial 
transmission  line  with  step  discontinuities.  The  transmission  line  shown  in  Fig.  7  consists  of  three  sections 
with  different  dielectric  (*r)  fillings.  The  effect  of  discontinuities  can  be  expressed  in  terms  of  an 
equivalent  T-circuit  (Fig.  7b).  The  component  values  of  the  equivalent  circuit  are  the  parameters  to  be 
optimized  by  the  GA  to  meet  the  S-parameter  specifications.  To  derive  the  S-parameters  for  each  selected 
values,  it  is  convenient  to  employ  an  approach  based  on  the  cascading  of  T-matrices. 

Each  of  the  three  sections  of  the  coaxial  line  can  be  represented  by  a  length  of  transmission  line 
whose  T-matrix  can  be  represented  by  T,  (see  [5]  ).  The  T-matrix  of  the  T-circmt  with  senes  inductors 
and  a  shunt  capacitor  that  represents  the  step  discontinuity,  can  be  expressed  by  Tc  [5].  The  composite  T- 
matrix  of  the  system  can  be  obtained  via  a  simple  cascading  procedure  of  matrix  multiplication,  and  the  S- 
matrix  parameters  can  then  be  derived  for  the  composite  T-matrix. 

To  illustrate  the  above  procedure  we  consider  the  coaxial  configuration.  The  dielectric  constants  sr 
for  the  center  line  is  chosen  to  be  1.5,  and  the  length  of  the  two  air-filled  and  dielectric-filled  linesi ire 
chosen  to  be  0.035m  and  0.1m,  respectively.  The  characteristic  impedance  is  50  ohms  for  each  of  the  three 
lines.  The  series  inductance  is  typically  negligible  for  lower  frequencies,  so  we  simply  use  an  esumated 
capacitance  value  of  0.01  pF  for  the  step  discontinuities,  compute  the  S-parameters  of  the  composite 


336 


structure,  and  feed  these  as  inputs  to  the  GA  program.  Fig.  8  plots  the  specified  and  computed  S- 
parameters,  and  also  shows  the  capacitance  values.  Once  again,  the  GA  has  successfully  selected  the 
component  values  and  has  recovered  the  equivalent  circuit  of  the  coaxial  line  discontinuity.  The  method 
can  be  easily  extended  to  coaxial  lines  containing  three  or  more  sections,  without  any  difficulty. 

4.  Conclusions 

In  this  paper,  we  have  demonstrated  a  new  technique  for  systematically  extracting  SPICE-type 
equivalent  circuits  of  microwave  components,  and  discontinuities  in  coaxial  lines,  from  computed  or 
measured  S-parameters.  The  equivalent  circuit  has  been  derived  by  using  a  GA-based  technique,  which  is 
a  robust  optimization  tool. 
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Table  1.  Comparison  of  original  and  GA-derived  component  values  for  the  planar 
_ inductor  shown  in  Fig.4. _ _ _ 


Component  values  of  the  equivalent  circuit 

Component  values  chosen  by  the  GA 

L  =  1.9790948  nH 

L  -  1.953125  nH 

C  =  0.7146493  pF 

C  =  0.703125  pF 

R  =  0.237845  Q 

R  =  0.1367188  Q 

L  *  =  2.190083  nH 

L  k  =  2.402344  nH 

Cp  =  0.265736  pF 

C  p  =  0.2148438  pF 

Table  2.  Comparison  of  original  and  GA-derived  component  values  for  the  parallel 
_ plate  capacitor  shown  in  Fig.6.  _ ■  _ 


Component  values  of  the  equivalent  circuit 

Component  values  chosen  by  the  GA 

Cgl  =  Cg2= 0.2798  pF 

Cgl  =  Cg2  =  0.3770  pF 

L  =  0.3847  nH 

L  =  0.6387  nH 

Rp  =0.0189  Q. 

Rp  =0.0195  Q 

Rd  =  29288. 8986  Q 

Rd  =  29284.05  Q 

Cm  -  1 0.8679  pF 

Cm=  10.8398  pF 
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T,  «  r~v~v^ — _j_ — - •  T2 

_ -p  c 


(b) 

Specified  GA 

L  «  0.1074  ntf  L  =  0.1074219  nH 

C-  0.2047  />F  C  =  0.2041016/;F 

Fig.  1 .  Right-angled  bend  in  microstrip  line  and  its  equivalent  circuit 


Fig.2.  Magnitude  of  the  specified  SI  1  and  S22  and  those  derived  from  the  GA  for  a  right-angled  bend. 
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Fig.5.  Physical  configuration  of  a  parallel  plate  capacitor  and  its  equivalent  circuit. 


(a)  (b) 

Fig.6.  Comparison  of  the  specified  S  „  and  S  2,  of  the  parallel  plate  capacitor 
with  those  computed  from  the  equivalent  circuit  selected  by  the  GA. 
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(b) 

Fig.7.  (a).  A  coaxial  transmission  line  structure. 

(b).  The  equivalent  circuit  of  the  coaxial  line  in  (a). 


Specified  GA 

C  =  0.0 1 0000  pF  C  =  0.0100098  pF 


Fig.  8.  Comparison  of  the  specified  S ,,  of  a  coaxial  line  with  that  computed  from 
the  equivalent  circuit  selected  by  the  GA. 
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Abstract 

This  paper  examines  several  optimization  techniques  as  they  are  applied  to  Yagi-Uda  antenna  desiga  Most  of  the 
techniques  discussed  are  various  forms  of  genetic  algorithms  (GAs).  A  gradient  method  is  also  used  both  alone  and  in 
conjunction  with  GAs  Statistics  are  gathered  as  each  method  is  ran  multiple  times  for  the  problem  of  a  five  element  array 
optimized  for  gain,  VSWR,  and  front-to-back  ratio  over  a  5%  bandwidth.  After  comparing  these  statistics,  the 
GA  +  Gradient  method  proved  to  be  the  most  efficient  of  the  methods  examined  It  also  produced  the  best  mean  antenna 
fitness  over  JO  optimization  runs. 

Introduction 

The  most  important  two  measures  of  an  optimization  technique  are  its  efficiency  and  ability  to  consistently  generate  good 
results.  As  long  as  the  first  of  these  is  reasonable,  the  latter  is  of  much  greater  importance.  Many  modem  optimization 
techniques,  such  as  genetic  algorithms  and  simulated  annealing,  are  stochastic  in  nature.  This  makes  it  difficult  if  not 
impossible  to  guarantee  the  quality  of  their  solution.  Gradient  methods,  on  the  other  hand,  are  efficient  techniques  which 
are  weU  known  to  converge  to  a  local  optimum.  Unfortunately  this  does  not  always  result  in  a  good  design  solution.  In 
highly  irregular  search  spaces  with  many  local  optima,  the  final  results  are  dependent  on  the  quality  of  the  supplied  initial 
guess.  This  short  coming  is  somewhat  ameliorated  by  running  multiple  optimizations  from  randomly  chosen  starting 
points  at  the  expense  of  longer  run  times. 

In  this  paper,  we  compare  several  variations  of  genetic  algorithms  and  a  gradient  method  by  applying  them  to  the  design  of 
a  five  element  Yagi-Uda  antenna  optimized  for  gain,  VSWR,  and  front-to-back  ratio  over  a  5%  bandwidth.  While  the 
performance  of  optimization  techniques  is  dependent  on  the  problem  being  solved,  this  study  provides  some  insight  into 
how  the  various  methods  will  perform  in  the  domain  of  antenna  design. 

Genetic  Algorithms 

Genetic  algorithms  are  an  optimization  technique  modeled  after  the  survival  of  the  fittest  paradigm  found  in  nature.  The 
theory  and  application  of  GAs  to  general  problems  is  well  described  in  the  literature  [1,  2].  Their  use  in  electromagnetics 
have  been  explored  in  [3, 4]  and  others.  Additionally,  [5]  provides  a  thorough  description  of  how  GAs  apply  to  the  design 
of  Yagi-Uda  antennas.  A  brief  overview  of  the  various  GAs  used  in  this  paper  follows. 

GAs  work  with  a  population  of  potential  design  solutions.  Individuals  in  the  population  are  given  a  fitness  score  based  on 
how  well  thej'  solve  the  design  problem.  A  probabilistic  selection  scheme  then  picks  individuals  out  of  the  population  with 
fitter  individuals  having  a  greater  chance  of  being  selected.  Various  genetic  operators  are  then  applied  to  the  selected 
individuals  to  produce  a  new  generation  of  designs.  These  operators  can  combine  information  from  two  designs  to  form  a 
new  individual,  randomly  mutate  part  of  an  individual  design,  or  simply  copy  an  individual  into  the  new  generation.  This 
process  of  evaluation,  selection,  and  then  applying  genetic  operators  is  carried  out  for  many  generations  until  the 
population  converges  to  a  single  solution  or  some  other  stopping  criteria  are  met 

The  abstract  representation  of  a  design  solution  in  a  GA  population  is  called  a  chromosome.  Chromosomes  can  have  a 
variety  of  structures.  For  parametric  optimization,  it  is  common  simply  to  concatenate  the  design  parameters  together  into 
a  string  of  numbers.  The  encoding  of  the  Yagi-Uda  antenna  shown  in  Figure  1  is  straight  forward.  The  variable  lengths 
and  spacings  of  the  array  elements  are  concatenated  together  to  form  a  string  of  numbers  as  shown  below. 
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Antenna  Chromosome  Representation:  L1L2S2L3S3L4S4L 

The  numeric  values  can  either  be  represented  as  binary  bit  strings  or  as  floating  point  numbers.  While  binary  encodings  are 
the  traditional  approach,  floating  point  representations  have  grown  in  popularity.  The  choice  of  encodings  does  not  affect 
most  implementation  issues  for  GAs,  but  it  does  affect  mutation  operators.  In  binary  bit  strings,  a  mutation  is  simply  a 
random  flipping  of  a  bit  Floating  point  numbers  are  mutated  by  adding  a  noise  term  to  their  value.  This  noise  term  can  be 
fabricated  a  variety  of  ways.  Choosing  from  a  Gaussian  distribution  centered  around  the  current  parameter  value  is  a  good 
choice. 


Figure  1:  Diagram  of  a  Yagi-Uda  array  with  the  length  and  spacing  parameters  labeled. 


Traditional  GAs  have  a  single  population  that  evolves  as  a  group.  A  variation  on  the  GA  theme  segments  the  population 
into  small  groups.  These  sub-populations  evolve  separately,  but  after  each  generation  a  few  individuals  from  each 
population  migrate  to  a  neighboring  population.  Using  sub-populations  along  with  migration  helps  encourage  population 
diversity  while  still  allowing  good  designs  to  spread  throughout  the  entire  population. 

The  Gradient  Method 

Unlike  genetic  algorithms  which  work  with  a  population  of  solutions,  gradient  methods  start  from  a  single  point  in  the 
design  space.  The  algorithm  evaluates  the  current  design  point  and  multiple  other  points  in  a  local  region  around  the 
current  point  to  determine  the  direction  of  steepest  accent  (or  decent  for  minimization)  The  design  point  is  then  moved  in 
this  direction  to  a  better  design  point  This  process  is  carried  out  until  a  local  maximum  is  found  such  that  the  gradient 
around  the  solution  becomes  zero. 

Gradient  methods  usually  converge  to  a  solution  very  quickly.  However,  the  quality  of  their  final  result  is  often  dependent 
on  the  quality  of  the  starting  point  To  insure  a  reasonable  solution  when  the  quality  of  the  starting  point  is  unknown,  it  is 
necessary  to  run  the  gradient  search  from  multiple  random  starting  locations  scattered  throughout  the  design  space.  For  the 
Yagi-Uda  antenna  design  problem,  searches  that  start  from  a  poor  initial  guess  usually  resulted  in  a  poor  final  solution.  In 
order  to  minimize  the  amount  of  time  spent  searching  poor  regions  of  the  search  space,  the  following  algorithm  was  used: 
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1.  Generate  P  random  antennas  with  parameters  between  the  limits  L lower  2nd  Lupper. 

2.  Pick  the  most  fit  antenna  from  1  and  use  it  as  the  starting  point  for  a  gradient  search. 

3.  Put  the  result  from  2  in  a  pool ,  B,  that  contains  the  final  solution  from  all  past  gradient  searches. 

4.  Narrow  the  search  space  by  centering  the  parameter  bounds,  Lfo»rr  and  Lupp(n  around  the  average  of  the  members  in  B 
weighted  by  their  score. 

R  =  Lupper  -  Llower 

RDew  =  z*R,  z<  1 


5. 


=  Center  - 


Lupper  Llower  Rnevi 

Here  R  is  the  parameter  range,  z  is  used  to  narrow  the  range  by  some  percentage,  and  FO 
function. 

Return  to  1  and  repeat  for  M  iterations. 


is  the  antenna  fitness 


Step  1  insures  the  gradient  search  starts  at  a  reasonable  location  in  the  design  space.  Steps  3  and  4  use  information  from 
past  evaluations  to  try  and  narrow  future  searches  into  a  more  promising  region  of  the  design  space. 


Gradient  optimization  can  also  be  used  in  conjunction  with  genetic  algorithms.  When  the  two  methods  are  combined,  the 
genetic  algorithm  is  run  for  a  smaller  than  usual  number  of  generations.  The  best  design  from  the  population  is  then  used 
as  the  starting  point  for  the  gradient  optimization.  This  approach  capitalizes  on  the  strengths  of  both  methods.  GAs  excel 
at  finding  good  regions  in  an  irregular  search  space  but  are  slow  to  converge  to  a  final  solution  while  gradient  methods 
work  well  at  searching  a  small  region  of  the  search  space  quickly. 


Results 

Five  different  search  methods  were  applied  to  the  problem  of  optimizing  a  five  element  Yagi-Uda  antenna  array.  The 
different  methods  are: 

1.  The  Gradient  search  algorithm  described  in  previous  section  (GRAD), 

2.  A  single  population  GA  with  binary  encoding  (GA-SB), 

3 .  A  single  population  GA  with  floating  point  encoding  (GA-SF), 

4.  A  multi-population  GA  with  floating  point  encoding  (GA-MF),  and 

5.  A  multi-population  GA  with  floating  point  encoding  +  gradient  optimization  (GA-MFG). 

For  the  gradient  method,  the  parameters  discussed  in  the  previous  section  are  as  follows:  P=50,  z^0.95,  and  N=10. 

The  settings  for  each  of  the  GAs  is  shown  in  Table  1.  Crossover  is  the  percentage  of  individuals  chosen  for  combination 
each  generation.  Mutation  is  the  percentage  of  bits  in  a  binary  encoding,  or  numbers  in  a  floating  point  encoding  chosen 
for  mutation.  Replacement  is  the  percentage  of  the  population  that  is  overwritten  with  new  individuals  each  generation 
Migration  is  the  number  of  individuals  that  move  to  a  neighboring  population  after  each  generation.  A  Sigma  Truncation 
Scaling  scheme  was  used  along  with  Stochastic  Random  Selection  during  the  selection  phase  of  the  GA.  These  parameters 
and  techniques  are  more  fully  discussed  in  [1]. 

All  but  the  last  GA  were  stopped  either  after  40  generations  or  when  the  standard  deviation  of  the  scores  in  a  population 
was  within  0.1%  of  the  average  population  score.  The  GA  +  Gradient  method  was  stopped  earlier  when  the  standard 
deviation  was  within  0.3%  of  the  average  fitness  at  which  point  the  gradient  method  took  over.  These  stopping  criteria 
generally  resulted  in  optimization  runs  that  required  between  5000  to  4000  function  evaluations. 
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GA  Type 

Number  of 
Pops. 

Pop. 

Size 

Crossover 

(%) 

Mutation 

(%) 

Replacement 

(%) 

Migration 

Convergence 
Stdev  (%) 

GA-SB 

1 

150 

80 

0.3 

80 

- 

0.1 

GA-SF 

1 

150 

80 

0.3 

80 

- 

0.1 

E2SZIBH 

5 

30 

80 

0.3 

80 

2 

0.1 

GA-MFG 

5 

30 

80 

0.3 

80 

2 

0.3 

Table  1:  Optimization  parameters  for  the  various  genetic  algorithms. 


The  antenna  was  optimized  for  gain,  VSWR,  and  front-to-back  ratio  over  a  5%  bandwidth  with  a  center  frequency  of 
300 MHz.  The  lengths  of  the  antenna  elements  were  allowed  to  vary  between  0.3/.  and  0.7k  and  the  spacings  varied 
between  0.15k  and  0.40k.  The  antennas  were  simulated  at  three  equally  spaced  frequencies  within  the  band  using  NEC4,  a 
Method  of  Moments  code.  [6]  The  function  used  to  evaluate  the  fitness  of  each  design  candidate  is  shown  below: 

Fitness  =  2 OG,^  - 1.5 VSWR^  + 1-5 FB^ 

Both  the  Gain,  G,  and  the  front-to-back  ratio,  FB,  are  specified  in  dB.  The  VSWR  was  calculated  assuming  a  50Q 
transmission  line  feed.  The  FB  ratio  was  calculated  by  dividing  the  forward  gain  by  the  maximum  gain  found  within  45  °  of 
the  backward  direction.  The  subscripts  min  and  max  indicate  the  minimum  or  maximum  value  for  a  parameter  over  the 
frequency  band.  Therefore,  this  equation  uses  the  worst  points  within  the  band  for  each  characteristic  to  calculate  the 
antenna  fitness. 

Each  of  the  optimization  methods  was  run  50  times.  The  results  shown  in  Table  2  list  the  average  number  of  antenna 
evaluations  per  run  along  with  statistics  on  the  final  antenna  scores.  The  ideal  optimization  algorithm  would  require  few 
evaluations  and  produce  a  high  average  score  with  a  small  standard  deviation.  If  the  standard  deviation  is  high,  a  larger 
number  of  optimization  runs  are  necessary  to  guarantee  good  results. 

For  antenna  optimization,  the  Method  of  Moments  antenna  simulations  are  by  far  the  most  computationally  intensive  part 
of  the  process.  This  is  so  much  the  case  that  comparing  the  number  of  antenna  evaluations  for  the  algorithms  is  a  reliable 
indicator  of  their  relative  efficiency.  To  ease  comparison,  the  speedup  of  each  algorithm  is  calculated  in  the  third  column 
using  GA-SF  as  the  baseline  method. 


Method 

Evals 

Speedup 

Mean 

Score 

Std. 

Dev. 

Min 

Score 

Max 

Score 

GRAD 

3211 

1.13 

188.03 

41.39 

102.9 

225.90 

GA-SB 

HES9HI 

■Hi am 

'7M 

GA-MF 

mWtM 

GA-MFG 

in 

1.66 

Table  2:  Statistics  for  the  various  optimization  methods  over  the  course  of  50  runs. 


Unfortunately,  the  enhancements  to  the  gradient  search  described  in  the  previous  section  did  not  reliably  produce  good 
antenna  designs.  Its  mean  score  is  far  lower  than  those  achieved  by  the  GAs.  Looking  at  the  genetic  algorithms,  the  single 
population  GAs  have  similar  mean  scores;  However,  the  floating  point  encoded  GA  converges  to  a  solution  much  faster 
than  the  binary  GA.  If  the  binary  GA  is  stopped  after  roughly  3500  iterations,  its  average  score  is  around  218  -  much 
lower  than  the  floating  point  method  for  the  same  number  of  evaluations.  Switching  from  the  single  population  GA  to  a 
multi-population  improves  the  mean  slightly  at  the  expense  of  more  evaluations.  The  multi-population  GA  +  Gradient 
(GA-MFG)  provides  both  the  best  mean  score  and  efficiency.  The  improved  efficiency  comes  from  stopping  the  GA 
sooner  than  normal  and  allowing  the  gradient  method  to  quickly  fine  tune  the  solution.  Table  3  lists  the  final  lengths  and 
spacings  for  an  approximately  average  antenna,  score  =  228.37,  and  the  best  antenna,  score  =  230.94,  found  by  the  GA- 
MFG  search  over  the  50  runs.  Graphs  of  the  VSWR,  gain,  and  F/B  ratio  are  shown  for  both  antennas  in  Figure  2.  While 
the  best  antenna  has  a  better  VSWR  and  F/B  ratio,  for  applications  where  gain  is  the  most  important  feature,  the  average 
antenna  might  be  a  better  choice. 
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Table  3:  The  lengths  and  spacings  for  an  approximately  average  and  the  best  antenna  design  generated  by  the  GA- 
MFG  over  the  course  of  SO  runs.  The  minimum  performance  numbers  across  the  band  are  also  shown  for 
gain,  vswr,  and  front-to-back  ratio. 

Notice  that  all  three  floating  point  GAs  produced  similar  maximum  scores  over  the  course  of  50  runs.  This  suggests  that 
any  of  these  methods  will  produce  excellent  results  given  enough  runs.  For  this  five  element  problem,  the  average  run  time 
for  a  GA-MFG  optimization  was  around  seven  minutes  on  a  300  MHz  Pentium  II  CPU.  It  is  always  a  good  idea  to  make 
several  optimization  runs  when  using  GAs,  however  5-10  runs  are  probably  more  reasonable  than  50  runs  --  especially  for 
larger  arrays  that  are  more  computationally  intensive  to  simulate.  The  large  number  of  runs  used  for  this  paper  was 
necessary  to  obtain  accurate  statistics. 

Conclusion 

This  study  has  examined  several  optimization  techniques  for  the  design  of  a  five  element  Yagi-Uda  antenna  array.  The 
GAs  proved  to  be  more  effective  at  producing  good  results  than  the  gradient  method  used.  Of  the  GAs,  floating  point 
encodings  faired  better  than  binary  encodings  and  multi-population  techniques  were  arguably  better  than  single  population 
techniques.  Finally,  the  multi-population  GA  +  Gradient  method  proved  to  be  the  best  approach  both  in  terms  of  efficiency 
and  consistent  production  of  good  antenna  designs. 
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Figure  2:  These  three  graphs  gain,  VSWR,  and  front-to-back  ratio  performance  an  average  antenna  (dashed)  and 
the  best  antenna  (solid)  generated  from  SO  runs  of  the  GA-MFB  optimization  method  across  the 
frequency  band. 
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Abstract 

A  systematic  approach  for  designing  broadband  low-sidelobe  linear  and  planar  antenna  arrays  by  varying 
interelement  spacings  has  not  been  available.  Both  the  large  number  of  possible  spacing  combinations  along 
with  theoretical  limitations  of  the  problem  have  made  it  difficult  to  formulate  robust  broadband  design 
techniques.  The  genetic  algorithm  (GA)  has  recently  proven  to  be  a  very  effective  design  tool  for  nonuniformly 
spaced  low  sidelobe  antenna  arrays  with  uniform  excitation  intended  for  operation  at  a  single  frequency  [5], 
This  paper  deals  with  a  generalization  of  previous  GA  techniques  to  include  the  design  of  low  sidelobe  arrays 
that  are  operable  over  a  band  of  frequencies.  It  will  be  shown  that  the  GA  objective  function  is  no  more 
complicated  to  evaluate  for  broad-banding  purposes  than  it  is  in  the  single  frequency  case.  Several  examples 
of  GA-designed  broadband  low  sidelobe  arrays  will  be  presented  and  discussed. 


1.  Introduction 

In  recent  years,  genetic  algorithms  have  found  a  fairly  strong  presence  in  electromagentics  optimization 
problems  involving  antenna  design.  The  difficulty  in  solving  many  antenna  design  problems  is  that  very  often 
there  are  many  parameters  and  no  practical  analytical  methods  available  to  optimally  determine  them.  Such 
difficulties  make  efficient  search  strategies,  like  genetic  algorithms,  very  important.  The  main  advantages  of 
using  the  GA  over  other  search  strategies  are:  1)  the  GA  can  search  from  any  number  of  random  points  to 
find  a  solution,  2)  the  GA  works  with  a  coding  of  the  parameters  and  not  the  actual  parameters,  3)  GA’s  use 
random,  not  deterministic,  transition  rules,  and  4)  the  GA  does  not  require  the  evaluation  of  derivatives  [1]. 
Among  the  first  applications  of  genetic  algorithms  in  antenna  design  was  the  thinning  of  large  arrays  [1]. 
Some  other  varieties  of  antenna  arTays  to  which  the  GA  has  been  applied  include  planar  arrays  [1 ,2),  multiple 
beam  arrays  [3],  and  Yagi-Uda  arrays  [4],  There  have  also  been  several  excellent  review  articles  written  about 
GA’s  and  their  application  to  solving  complex  engineering  electromagnetics  problems  [5-7]. 

The  capability  of  GA’s  to  produce  optimal  low  sidelobe  designs  for  linear  arrays  of  uniformly  excited  isotropic 
sources  (at  a  single  frequency)  by  allowing  only  the  interelement  spacings  to  vary  was  first  demonstrated  in 
[5],  Interelement  spacings  were  decided  by  using  a  3  bit  parameter  such  that  they  could  vary  in  increments 
of  X/8  with  a  minimum  interelement  spacing  of  7J4.  In  this  paper,  we  will  demonstrate  that  GAs  are  also  an 
extremely  useful  tool  for  broad-banding  of  uniformly  excited,  unequally  spaced  antenna  arrays.  There  are  two 
major  advantages  of  the  technique  employed  in  this  paper  when  compared  to  previously  published  methods, 
such  as  those  described  in  [5].  These  advantages  are  that  1)  a  much  finer  discretization  <=  ±  0.01  A )  will  be 
used,  and  2)  the  GA-designed  arrays  will  have  minimal  sidelobes  over  a  band  of  frequencies  instead  of  at  just 
one  frequency.  The  steady-state  genetic  algorithm  with  uniform  crossover  [8]  was  chosen  for  use  in  optimizing 
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the  array  designs  discussed  in  this  paper. 

Although  many  traditional  analytical  techniques  exist  for  placing  elements  in  unequally  spaced  arrays  for 
broad-banding  purposes,  viz.  [9-11],  none  of  these  methods  are  capable  of  producing  significantly  low 
sidelobe  levels  over  the  entire  band.  The  focus  of  many  of  these  methods  is  to  place  elements  in  an  array 
such  that  the  minimum  separation  between  elements  is  greater  than  or  even  much  greater  than  a  wavelength. 
The  advantage  of  such  large  interelement  spacings  is  that  a  larger  bandwidth  can  be  achieved  because  a 
lower  minimum  frequency  is  possible.  The  disadvantage,  however,  is  that  a  theoretical  lower  bound  exists 
on  the  sidelobe  level  when  average  interelement  separations  exceed  a  wavelength  [12].  This  theoretical 
minimum  is  usually  not  low  enough  for  practical  applications.  Keeping  in  mind  this  theoretical  limitation,  a 
design  optimization  technique  will  be  introduced  in  this  paper  which  attempts  to  place  elements  such  that  the 
average  interelement  spacing  in  the  array  is  always  less  than  a  wavelength. 

Another  important  consideration  in  the  design  of  antenna  arrays  is  their  steerability.  Broad-band  arrays  have 
the  properly  that  they  may  exhibit  perfect  steerability  at  lower  frequencies  of  operation,  but  steerability  is 
reduced  when  moving  to  higher  frequencies.  The  fact  that  steerability  changes  with  frequency  can  be 
quantified  by  the  bandwidth-steerability  product  of  the  array  [12]. 

A  useful  conversion  factor  will  be  introduced  in  Section  2  that  permits  design  tradeoffs  to  be  made  between 
bandwidth  and  (minimum)  element  separation.  Steerability  issues  will  also  be  briefly  discussed  in  Section  2. 
Sections  3  and  4  present  and  discuss  four  examples  of  genetic  algorithm  produced  broadband  low  sidelobe 
array  designs  -  two  linear  arrays  (Section  3)  and  two  planar  arrays  (Section  4).  In  addition,  the  GA  objective 
function  used  to  produce  each  design  is  given  in  the  respective  sections.  All  array  designs  considered  in  this 
paper  were  specified  to  have  a  maximum  possible  bandwidth  with  a  minimum  element  separation  of  A/4  and 
the  lowest  possible  sidelobe  level  throughout  the  band. 

2.  Some  Considerations  for  Broad-Banding  Arrays 


In  designing  a  broadband  array  for  low  sidelobe  performance,  it  is  sufficient  to  design  for  the  highest  desired 
frequency  of  operation  f2 .  Having  done  this,  the  frequency  may  then  be  varied  from  f2  to  any  f ,  provided 
/<  f1%  without  the  appearance  of  any  higher  sidelobes.  The  bandwidth  for  such  an  array  is  defined  to  be 
B  =  f2/fl.  Furthermore,  we  note  that  if  a  minimum  separation  between  two  elements  exists  at  the  lowest 
design  frequency  _/j  that  is  considered  too  small  for  practical  purposes,  then  that  spacing  can  be  made  larger 
at  the  expense  of  a  smaller  bandwidth  B'  <  B  (i.e.,  f2  <  f2 ).  This  property  is  best  illustrated  by  the  following 
transformation: 


(1) 


where 


B'=f;/fi 
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s  =  the  set  of  original  element  locations  {s„  -dn/ 1,  :  n  =  1,2,...,  N} 
s'  =  the  set  of  new  element  locations  {s'  =  d'nXx  :n  =  l,2,...,N\ 

Hence,  the  array  configuration  need  only  be  optimized  for  a  desired  maximum  bandwidth  B,  subject  to  some 
specified  tolerance  on  the  minimum  element  separation.  Once  this  optimal  array  design  has  been  found  using 
the  GA  then,  if  desired,  the  transformation  given  in  (1)  may  be  employed  to  find  modified  designs  which 
tradeoff  larger  element  separations  for  smaller  operating  bandwidths. 

Another  notable  characteristic  of  broadband  arrays  is  how  steerability  is  affected  with  increasing  bandwidth. 
It  can  be  shown  that  the  bandwith  and  steerability  of  a  linear  array  are  related  by  the  following  formula,  which 
is  known  as  the  bandwidth-steerability  product  [11]: 

B(l+C (2) 


where 


B  =  bandwidth 
90  =  steering  angle 

w0  =  the  maximum  value  of  (dme  I X) cos 8  that  can  be  used  before  a  sidelobe  will  exceed  the 
desired  sidelobe  level 

dme  =  average  interelement  separation  in  the  array 
^min =  smalfest  interelement  separation  in  the  array 

The  right-hand  side  of  this  equation  is  a  constant,  and  is  characteristic  of  the  individual  array.  Note  that  at  a 
1-1  bandwidth,  the  right-hand  side  of  the  equation  must  be  at  least  two  to  guarantee  perfect  steerability.  When 
a  bandwidth  of  larger  than  1:1  is  desired,  the  left-hand  side  of  this  equation  limits  steerability  at  some  of  the 
higher  frequencies  in  the  band.  Thus,  while  a  broadband  array  may  exhibit  perfect  steerability  at  lower 
frequencies  of  operation,  steerability  may  become  limited  at  higher  frequencies  of  operation,  in  addition, 
arrays  designed  to  operate  at  only  one  frequency  when  interelement  spacings  are  small  may  not  exhibit  any 
steerability. 

3.  Linear  Broadband  Array  Designs 

The  array  factor  expression  for  the  far-field  radiation  pattern  of  a  linear  array  of  isotropic  sources  can  be 
written  in  the  following  form: 
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(3) 


AF{0)  =  2^1  n  cos  [2x(f  If, )dn  cos0  +  a„] 

n=\ 

where 

an  =  -2.k {f  /  fx  )dn  cos  0O  (4) 

and 

2N  -  the  total  number  of  elements  in  the  array 

I„  =  excitation  current  amplitude  of  the  nth  element  in  the  array 

an  =  excitation  current  phase  of  the  nth  element  in  the  array 

sn~dny lj  =  total  distance  of  the  nth  element  from  the  axis 

6 =  angle  measured  from  the  line  passing  through  antenna  elements 

0O  =  steering  angle 

yj  =  base  (minimum)  frequency  of  operation 
/  =  desired  frequency  of  operation 

The  objective  function  used  by  the  GA  in  this  paper  is  based  on  the  array  factor  expression  given  in  (3),  where 
the  desired  goal  is  to  minimize  the  maximum  sidelobe  level  of  the  array  over  some  prescribed  bandwidth.  The 
form  of  this  objective  function  is 


F(0)  =  max 


cos[2tz2W„  cos  (9] 
AF^tf) 


where 


(5) 


AF^  (0)  -  peak  of  the  main  beam  (for  normalization) 

B-f2/f1=  desired  bandwidth  of  the  array  ( f2  >  f ) 

The  parameters  dn  were  selected  by  the  GA  to  minimize  the  maximum  sidelobe  level  with  In  set  to  unity  for 
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all  values  of  n.  The  discretization  of  dn  was  made  relatively  fine,  such  that  it  could  be  varied  in  increments 
of  approximately  ±0.01  between  zero  and  some  maximum  selected  value.  The  use  of  any  finer  discretization 
was  found  to  yield  little  improvement  in  the  overall  results.  It  was  also  found  that,  in  the  case  of  broadband 
array  optimization  (B>1),  the  GA  objective  function  need  not  be  any  more  complicated  to  evaluate  than  it  is 
for  optimization  of  array  performance  at  a  single  frequency  ( B=1 ).  This  is  one  of  the  attractive  features  of  the 
technique  presented  here,  since  it  means  that  the  overall  design  optimization  time  required  by  the  GA  will  be 
essentially  the  same  regardless  of  whether  single-frequency  or  broadband  array  configurations  are  being 
considered. 

The  first  broadband  design  that  will  be  considered  is  a  uniformly  excited  40  element  array  with  a  minimum 
element  separation  requirement  of  a  quarter-wavelength  at  the  lowest  design  frequency  /  =  /,,  i.e., 
An  ~d„)>  0.25  V n=l,2,...,N-l.  In  this  case,  the  GA  was  able  to  optimize  interelement 

spacings  in  the  40  element  array  so  that  a  bandwidth  of  B  =  3.5  was  possible  for  broadside  operation  with  a 
maximum  sidelobe  level  of -19.41dB  throughout  the  entire  band  (see  Table  1).  Figures  la-lc  show  plots 
of  the  array  factor  at  the  low-band  ( f  =  ),  mid-band  =  +  fi ) /  2 ),  and  high-band  ( /  =  /2 )  design 

frequencies.  Furthermore,  Figures  2a-2c  show  the  optimized  40  element  array  with  the  main-beam  steered 
to  60°  for  the  same  three  frequencies  considered  in  Figures  la-lc.  These  figures  demonstrate  that,  for  the 
low-band  (/  =  f )  and  mid-band  (/  =  2.25/j )  frequencies,  it  is  possible  to  steer  the  main-beam  to  60° 
without  any  increase  in  the  synthesized  sidelobe  level.  However,  for  the  high-band  frequency  (f  —  3.5/j ), 
we  see  that  the  synthesized  sidelobe  level  can  no  longer  be  maintained  when  the  beam  is  steered  to  60°. 
Further  investigation  reveals  that  the  steerability  of  this  array  at  /  =  /2  =  3.5/j  is,  as  predicted  by  (2),  almost 
negligible.  It  was  found  that  the  maximum  frequency  at  which  this  array  exhibits  perfect  steerability  is 
/  =  1 .75/j . 

Larger  sized  arrays  were  found  to  be  capable  of  producing  wider  bandwidths.  For  example,  using  A,  /4  as 
the  minimum  interelement  separation  for  a  100  element  array,  it  was  possible  to  optimize  the  array 
configuration  using  the  GA  to  yield  a  bandwidth  of  B  =  3.97  and  a  maximum  sidelobe  level  of  -20.32  dB  (see 
Figure  3  and  Table  1).  The  maximum  frequency  at  which  this  array  is  steerable  to  60°  is  at  f=2.64f1,  and  the 
maximum  frequency  at  which  this  array  exhibits  perfect  steerability  is  at  f  —  1.98fi  (see  Figure  5).  If  larger 
interelement  spacings  are  desired,  then  (1)  may  be  used  to  determine  the  corresponding  reduction  in 
bandwidth  that  would  result.  For  instance,  increasing  the  A,/  4  minimum  interelement  separation  to 
0.49625  A,  reduces  the  bandwidth  from  3.97  down  to  2.  On  the  other  hand,  the  bandwidth  of  the  array  can 
be  doubled  to  B  =  7.94  by  allowing  the  minimum  element  separations  to  be  as  small  as  A,  /  8 .  Reducing  the 
minimum  element  separation  to  Aj  /8  also  doubles  the  bandwidth  over  which  the  array  is  steerable  -  i.e.,  in 
this  case  the  above  array  would  be  perfectly  steerable  over  a  bandwidth  of  B  =  3.96  with  a  maximum  sidelobe 
level  of -20.32  dB. 

4.  Planar  Broadband  Array  Designs 

The  GA  optimization  procedure  described  in  the  previous  section  for  broadbanding  linear  arrays  will  be 
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generalized  in  this  section  to  include  planar  array  configurations.  In  particular,  the  GA  design  approach  will 
be  developed  for  rectangular  arrays  as  well  as  for  concentric  circular  arrays  with  variable  element  spacings. 
The  array  factor  for  a  non-uniformly  spaced  rectangular  array  of  isotropic  sources  may  be  represented  in  the 
following  form: 

AF(9,P)  =  cos[2;zz4,  (///,)  sin  6  sin  (f\zos\2nd ym  (///,)  sin  0  cos  (6) 

n=l  m=l 


where 


2M  =  total  number  of  elements  in  the  y-direction 
2N  =  total  number  of  elements  in  the  x-direction 
sm  =  dxnA j  =  element  locations  in  the  x-direction 


sym  =  =  element  locations  in  the  y-direction 

The  corresponding  objective  function  in  this  case  is  given  by 

X  it,  cos  [2nd  „  B  sin  6  sin  <f>]  cos[2jvd  vm  B  sin  6  cos  6] 

F(9J)  =  max42Z4n -  - 

n=l  m=l  ^max( &*9) 


(7) 


The  GA  uses  (7)  to  determine  the  set  of  parameters  dm  and  dym  that  yields  the  lowest  possible  sidelobe 
level  over  a  specified  bandwidth  B,  assuming  that  the  array  is  uniformly  excited  (i.e.,  lmn  =  1  for  all  values  of 
m  and  n).  in  order  to  accomplish  this,  a  spacing  scheme  was  designed  such  that  rows  and  columns  were 
treated  the  same.  For  example,  the  GA  selects  a  set  of  spacings  s  =  \d± ,  d2 ,  d3 , . . . ,  dn } ,  where  n  is  the 
number  of  rows  and  columns  in  the  array  (i.e.,  N  ~M  =  «  where  N  and  M  are  from  (6)).  The  number 
djt  1  <>i<n,  then  represents  the  interelement  spacing  between  elements  (i,j)  and  (i - l,y)  and  the 
elements  (/,;')  and  (y,z-l)  V/ a  1  <  j  <n,  where  the  indices  (0,j)  and  (j, 0)  are  the  y  and  x  axes 
respectively.  This  scheme  makes  the  objective  function  very  simple  to  evaluate  because  the  maximum 
sidelobe  level  will  always  be  located  in  the  ^  =  0°  and  <j>  =  90°  planes.  Figure  4a  and  Figure  4b  show 
radiation  pattern  cuts  at  <j>  =  0°  and  =  45° ,  respectively,  for  a  4,096  element  array  which  was  designed 
to  produced  a  bandwidth  of  B-  3.5  and  a  maximum  sidelobe  level  of-19.41dB  with  a  minimum  specified 
element  separation  of  Xy  I  A. 

Next  we  will  consider  an  alternative  design  optimization  approach  based  on  concentric  circular  arrays  which 
results  in  a  more  spatially  uniform  distribution  of  sidelobes.  The  objective  function  in  this  case  was  chosen 
to  be 


353 


F(6,<f))  =  maxi 


M  Nm 

X  2  A™  exp[/2jrftom  sin  <9  cos(0  -  )  +  jam 

r=l  n=1 


^Fmax(^^) 


where 


M  W, 

4(F(0,  ^)  =  X  2  exp[/2^(/  /  /  )am  sin  0  cos(#  -  0mn )  +  jamn  ]  (9) 

m=l  «=1 

ormn  =  -2x{f  /  /, )  sin  0O  cos(&  -  ^  )  (10) 


and 


rOT  =  am\  =  radius  of  the  mfri  ring  array 
M  =  total  number  of  concentric  ring  arrays 
Nm  =  total  number  of  elements  in  the  mfri  ring 

The  spacing  scheme  was  designed  such  that  elements  were  placed  on  arcs  spaced  \  /  4  apart,  where  A, 
corresponds  to  the  wavelength  at  the  lowest  design  frequency  .  In  addition  to  this,  the  elements  in  each 
quadrant  were  assumed  to  be  arranged  symmetrically  about  their  respective  diagonal  axis  (e.g.,  the  elements 
in  the  first  quadrant  are  symmetric  with  respect  to  the  ^  =  45°  axis).  Figure  5  shows  one  quadrant  of  a 
concentric  circular  array  in  the  equally  spaced  case  that  could  be  constructed  using  this  spacing  scheme.  For 
this  example,  the  minimum  arc  length  between  any  two  consecutive  array  elements  was  set  to  /4.  Figure 
6  shows  the  radiation  pattern  produced  over  the  <f>  =  45°  cut  by  a  308  element  concentric  circular  array  with 
element  spacings  optimized  to  yield  a  bandwidth  of  B  =  3.5  with  a  maximum  sidelobe  level  of  — 21.91dB 
throughout  the  band. 

5.  Conclusions 

Uniformly  excited  array  broad-banding  has  been  achieved  using  a  genetic  algorithm  optimization  procedure 
with  bandwidths  as  large  as  B  =  3.97  for  linear  arrays  and  B  =  3.5  for  planar  arrays  with  minimum  element 
separation  of  A/4.  Minimum  element  separation  can  easily  be  made  larger  to  avoid  mutual  coupling  effects, 
or  it  can  be  made  smaller  to  increase  bandwidth  by  using  the  convenient  conversion  factor  given  in  (1).  Array 
steerability  issues  have  also  been  addressed  in  this  paper.  Steerability  varies  with  operation  frequency  as 
predicted  by  (2)  -  it  is  greater  at  lower  frequencies  of  operation  and  lesser  at  higher  frequencies  of  operation. 
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In  addition,  the  bandwidth  over  which  the  array  is  steerable  improves  proportionally  as  (1)  is  used  to  increase 

bandwidth. 
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theta 


Figures  la-lc.  Uniformly  excited,  80  element,  isotropic, 
linear  array  factor  plotted  at  broadside:  1(a)  f  j  f-  1. 
1(b)  ///]=  2.25,  and  1(c)  fjf  =  3.5.  Maximum 
bandwidth  is  /2  j  f  5  =  3.5. 
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(d,Ai) 
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Number 

(n) 

Element 

Separation 

(dA.)  .  _ 

Element 

Number 

(n) 

Element 
Separation 
(d  Jh) 

1 

0.125 

6 

0.25 

11 

0.25 

16 

0.26 

2 

0.25 

7 

0.25 

12 

0.26 

17 

0.26 

3 

0.25 

8 

0.25 

13 

0.25 

18 

0.26 

4 

0.25 

9 

0.25 

14 

0.30 

19 

1.15 

5 

0.25 

10 

0.25 

15 

0.26 

20 

0.86 

Table  1.  Element  separations  at  f  for  the  array  depicted  in  Figures  la-lc. 
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Figure  3.  Uniformly  excited,  100  element,  isotropic,  linear 
array  plotted  at  f  j  fy=  3.97. 


Figures  4a-4b.  Radiation  pattern  cuts  at  <j)  =  0°  (Figure 
4a)  and  <j)  =  45°  (Figure  4b)  of  a  4,096  element  square 
planar  array,  f j  f\~  3.5 
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Figure  5.  One  quadrant  of  an  equally  spaced  concentric 
circular  arc  array  that  is  arranged  symmetrically  about  its 
diagonal  axis. 


Figure  6.  Broad-band  concentric  circular  arc  array 

<P  =  45° .  ///,-  3.5 
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Abstract 

The  best  size  of  the  population  and  percentage  of  mutation  of  phase-only  adaptive  nulling 
and  amplitude  &  phase  adaptive  nulling  with  a  genetic  algorithm  (GA)  are  found  based  on  the 
null  depths  of  transient  and  steady  state  responds  of  adaptive  nulling  since  the  convergence  speed 
of  a  GA  and  the  final  solution  of  a  GA  depend  on  the  size  of  population,  percentage  of  mutation 
and  the  other  parameters  in  genetic  algorithms.  The  null  depths  of  phase-only  and  amplitude  & 
phase  adaptive  32  elements  half  wavelength  spaced  linear  arrays  with  a  GA  are  compared  when 
single  interference  incidents  on  an  array.  All  simulation  results  are  averaged  over  30  runs  after 
25  generations. 


1.  Introduction 

The  genetic  algorithm  (GA)  parameters:  size  of  population,  discard  rate  and  percentage 
of  mutation  have  been  researched  to  optimize  the  GA  performance  [1  -  10].  Increasing  the 
number  of  mutations  increases  the  GA’s  ability  to  search  outside  of  current  convergence  area.  In 
addition,  a  large  population  generates  more  offspring  to  search  a  wider  area.  Therefore,  the 
performance  of  a  GA  depends  on  the  size  of  population,  percentage  of  mutation,  and  the  other 
parameters  in  genetic  algorithms  within  a  certain  generation.  Adaptive  nulling  with  a  GA 
requires  generating  multiple  deep  nulls  very  quickly  so  that  the  best  or  proper  mutation  rate  and 
population  size  are  required  to  optimize  the  adaptive  nulling  with  a  GA.  One  author  has 
suggested  a  mutation  rate  of  1/L  where  L  is  number  of  bits  in  a  chromosome  [8]. 

The  best  population  size  and  percentage  of  mutation  will  be  found  for  3  bit  phase-only 
adaptive  nulling  and  5  bit  amplitude  &  3  bit  phase  adaptive  nulling  with  a  GA  in  terms  of  the 
null  depths  of  transient  and  steady  state  responds  of  adaptive  nulling.  The  3  bit  phase-only 
adaptive  nulling  with  a  GA  outperforms  any  other  bit  setup  phase-only  with  a  GA,  and  5  bit 
amplitude  and  3  bit  phase  adaptive  nulling  with  a  GA  outperforms  any  other  phase-only  and 
amplitude  &  phase  adaptive  nulling  with  a  GA  [11].  Multiple  interference  sources  can  be  non- 
symmetric,  symmetric  or  closely  spaces  in  angle.  In  this  paper,  only  results  for  a  single 
interference  at  109  degrees,  that  is  larger  than  10000  times  of  desired  signal,  are  shown.  The 
population  size  (chromosomes)  is  half  of  initial  population  when  the  discard  rate  is  fixed  as  0.5. 
The  population  is  varied  from  10  to  42,  and  mutation  rate  is  varied  from  0.1%  to  5%  since  a 
population  of  42  is  good  enough  to  place  nulls,  and  lager  than  5%  and  smaller  than  0.1% 
mutation  rate  give  worse  results  than  0.1  ~  5%  mutation  rate  for  single  interference  case.  All  the 
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results  of  different  population  size  and  mutation  rates  are  averaged  over  30  runs  after  25 
generations. 

2.  Linear  Array  Elements  and  Interference 

A  linear  array  using  amplitude  and  phase  adaptive  nulling  with  a  GA  is  shown  in  Figure 
1.  The  GA  controls  the  least  significant  bits  of  the  phase  shifter  bits  and  amplitude  weight  bits, 
and  minimizes  output  power  while  it  maintains  the  main  beam  direction  and  gain  [10,  1 1].  The 
array  pattern  of  half  wavelength  spaced  32  linear  elements  array  with  initial  ~30dB  Chebyshev 
amplitude  taper  and  single  possible  interference  are  shown  in  Figure  2. 

The  linear  array  pattern  is  given  by 

AF{0)  =  2]T  an  cos[(rc  - l)*?  +  A„  +  S„  ].  (!) 

n=l 

aj,  array  weight  at  element  n;  'F  =  kdu-, 

k  =  271 1  X\  a=  wavelength; 

d  =  spacing  between  elements;  0  =  angle  in  degree; 

Sn  =  nulling  phase;  A„=  beam-steering  phase; 

<])  =  angle  of  incidence  of  electromagnetic  plane  wave; 

The  amplitudes  are  even  symmetric,  and  the  phase  is  odd  symmetric  [2, 10,  1 1]. 


incident 

field 


Fig.  1 .  Diagram  of  an  amplitude  and  phase  adaptive  linear  array. 
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Fig.  2.  Initial  array  pattern  with  single  interference  location,  109  degree. 

3.  Results 

The  results  of  phase-only  and  amplitude  &  phase  adaptive  nulling  for  an  interference  at 
109  degrees  with  a  GA  based  on  different  size  of  population  and  mutation  rate  are  shown  in 
Figures  3  to  8.  All  the  results  are  averaged  over  30  runs  after  25  generation  on  each  population 
and  mutation  rate. 

The  null  depth  in  dB  of  phase-only  and  amplitude  &  phase  adaptive  nulling  with  a  GA 
based  on  population  and  mutation  rate  are  shown  in  Figures  3  and  4,  respectively.  Figures  3  and 
4  show  that  null  depth  of  amplitude  &  phase  is  deeper  than  null  depth  of  phase-only  after  25 
generations.  The  best  mutation  rate  of  phase-only  adaptive  nulling  is  from  2%  to  5%  with  42 
chromosomes  in  the  population,  and  the  range  of  the  best  mutation  becomes  smaller  when 
population  size  is  reduced.  Figure  4,  amplitude  &  phase,  shows  from  1%  to  3%  mutation  with 
42  population  gives  the  best  null  depth  after  25  generations.  When  the  population  size  is  reduced 
to  20,  the  best  mutation  rate  range  is  reduced  to  1  ~  2.2%  (figure  4).  The  proper  mutation  rate  of 
amplitude  &  phase  adaptive  nulling  is  smaller  than  that  of  phase-only  since  the  recommended 
mutation  rate  is  1/L  where  UmPiitude& phase  =  128  and  Lphase-oniy  =  48. 

Figures  5  and  6  show  transient  responds  of  phase-only  and  amplitude  &  phase  adaptive 
nulling  with  a  GA.  Figures  5  and  6  show  that  phase-only  and  amplitude  &  phase  adaptive 
nulling  produce  almost  the  same  null  depth  up  to  -70dB  even  though  the  mutation  rate  is 
changed.  In  addition,  the  amplitude  &  phase  nulling  generates  deeper  nulls  quickly  and  the  null 
depth  is  improved  constantly  while  the  null  depth  by  phase-only  are  not  improved  after  -70dB. 
Both  Figures  5  and  6  indicate  that  below  0.8%  and  above  5%  mutation  rates  are  not 
recommended  since  the  results  of  both  range  of  mutation  rates  do  not  give  good  transient  and 
steady  state  responds. 

Figures  7  and  8  show  transient  null  depth  in  dB  is  based  on  different  population  with  the 
best  mutation  rate.  The  amplitude  &  phase  adaptive  nulling  with  a  larger  population,  42,  gives 
better  transient  and  steady  state  responds  than  any  others.  The  phase-only  and  amplitude  & 
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phase  adaptive  nulling  with  a  genetic  algorithm  generate  deep  -60dB  nulls  in  5  generations 
except  phase-only  with  10  and  14  chromosome  in  the  population.  Generally,  the  -50dB  nulls  are 
achieved  at  the  first  generation  with  more  than  14  chromosome  in  the  population  in  both  phase- 
only  and  amplitude  and  phase  adaptive  nulling  with  a  GA. 

4.  Conclusion 

The  best  values  for  the  proper  size  of  population  and  mutation  rate  of  phase-only  and 
amplitude  &  phase  adaptive  nulling  with  a  genetic  algorithm  are  found  when  a  single 
interference  is  incident  on  a  32  element  half  wavelength  spaced  linear  array.  The  null  depths  of 
each  population  size  and  mutation  rate  are  averaged  over  30  independent  runs  after  25 
generations. 

The  population  should  be  over  14  chromosome  to  generate  -50dB  nulls  at  the  first 
generation,  and  phase-only  adaptive  nulling  with  a  GA  requires  higher  mutation  rate  than 
amplitude  &  phase  adaptive  nulling  with  a  GA  since  the  number  of  bits  in  a  chromosome  of 
phase-only  is  less  than  that  of  amplitude  &  phase.  The  best  mutation  rate  should  be  larger  than 
0.8%  and  smaller  than  5%,  and  the  best  mutation  rate  should  be  chosen  based  on  the  population 
size. 

Generally,  the  phase-only  and  amplitude  &  phase  adaptive  nulling  with  a  genetic 
algorithm  generate  deep  nulls  quickly,  and  amplitude  &  phase  adaptive  nulling  generates  deeper 
null  than  phase-only  adaptive  nulling  with  a  GA. 
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Abstract —  Mutual  coupling  among  array  elements  would  invariably  distort  the  desired  pattern  shape  of  an 
array  designed  with  ideal  elements.  A  genetic  algorithm  (GA)  is  proposed  to  compensate  for  the  mutual 
coupling  effects  in  array  antenna  design.  In  this  study,  the  Numerical  Electromagnetic  Code  (NEC)  is 
worked  together  with  the  GA  optimizer  that  employs  no  binary  coding  and  decoding.  Near-solutions  from 
other  analytic  or  heuristic  techniques  may  be  injected  into  the  initial  population  to  speed  up  convergence. 
Numerical  examples  of  circular  arrays  are  presented  to  show  the  effectiveness  of  the  approach. 


I.  Introduction 

The  gain  of  an  array  is  inextricably  related  to  the  gain  of  its  individual  elements  [1].  However,  the  gain  of  an 
isolated  element  may  be  very  different  from  that  of  the  same  element  in  the  presence  of  the  rest  of  the  array.  In 
addition,  the  element  patterns  and  gain  vary  across  the  array  with  the  elements  near  the  edge  behaving  quite 
unlike  those  near  the  center.  This  behavior  is  due  to  the  electromagnetic  coupling  among  elements.  It  can  result 
in  more  or  less  element  gain  in  the  array  environment  than  when  isolated. 

Many  iterative  algorithms  exist  to  combat  the  mutual  coupling  effects  and  numerous  studies  have  been  conducted 
to  understand  its  effects  [2]-[6].  In  any  finite  array,  the  designated  currents  and  fields  differ  from  element  to  element 
in  magnitude,  phase  and  distribution.  These  differences  vary  as  a  function  of  frequency  and  array  beam  scanning 
angle.  In  practice,  this  complex  interaction  can  be  evaluated  by  writing  the  radiated  field  from  all  elements  as 
generalized  integrals  that  include  current  distributions  over  the  surface  of  radiating  elements  and  nearby  diffracting 
bodies,  such  as  through  the  use  of  Numerical  Electromagnetic  Code  (NEC). 

The  NEC  program  is  embedded,  as  part  of  the  GA  loop,  within  the  fitness  evaluation  function  which  provides  it 
with  the  relevant  NEC  codes  to  simulate  the  realistic  impedance  and  pattern  formation.  From  the  NEC  output, 
the  evaluation  function  may  then  proceed  to  determine  the  fitness  level  of  a  candidate  pattern.  In  the  following 
sections,  the  results  for  mutual  coupling  compensation  (MCC)  using  the  EMS-GA  [7]  are  elaborated. 

II.  Problem  Formulation 

For  an  arbitrary  array,  its  array  factor,  AF,  can  be  expressed  by 

AF  ■=  wTS(<l>,  <t>m)  (1) 
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where 


w  =  {wi,W2,-  ■  ■  ,WN}T,wn  €  Cn,  (2) 

is  the  weighting  vector,  S  is  the  steering  vector,  <(>  and  <j>m  are  the  direction  variable  and  main  beam  direction, 
respectively.  Cn  is  the  set  or  a  subset  of  all  complex  numbers. 

Take  for  example,  a  circular  array  of  N  identical  elements,  the  steering  vector  is 


S (<j>,  <j>m)  =  expfj'fcci  *  [C0S(<£  -  4>n)  -  COS (<j>m  -  «/>n)]}i  (3) 

where  a  is  the  radius  of  the  circular  array,  4>n  =  2im/N,  <f>m  is  the  main  beam  pointing  direction,  and  4>  is  the 
azimuth  angle,  provided  the  elevation  is  fixed  at  90°.  The  same  set  of  optimum  weights  for  the  main  beam  at 
a  particular  direction  can  be  used  for  other  directions,  if  the  above  S  vector  is  recalculated  for  the  new  beam 
pointing  direction.  Thereafter  any  near-solution  can  be  planted  into  the  initial  population,  and  the  EMS-GA  is 
applied  to  correct  the  sidelobe  level  ( SLL )  and  main  beam  shape  of  the  pattern,  while  takmg  mutual  coupling  into 
consideration. 


III.  The  Genetic  Algorithm 

Based  on  natural  evolution  [8],  genetic  algorithms  capitalize  on  tools  that  work  well  in  nature.  A  mathematical 
treatment  on  GA’s  foundations  is  provided  in  [9],  while  numerous  contemporary  GA  applications  can  be  found 
in  [10].  Living  beings  are  encoded  by  chromosomes,  one  may  encodes  the  possible  solutions  in  the  form  of  data 
structures  with  GAs. 


A.  Chromosome  Structure 

Most  GAs  use  binary  coding  and  binary  genetic  operations  [11].  The  proposed  approach  however  applies  floating¬ 
point  genetic  operations  on  complex  array  weighting  vectors.  Hence,  each  chromosome  is  a  vector  of  complex 
numbers  and  the  dimension  of  the  vector  is  equivalent  to  the  number  of  array  elements. 


B.  Initial  Population 

An  initial  population  of  at  least  100  random  chromosomes  is  generated.  The  constrained  adaptive  algorithm  in  [12] 
is  employed  to  produce  a  near-solution,  w ^  ,  with  the  elevation  angle  set  to  90°,  to  replace  the  weakest  individual 
among  the  initial  population.  Its  insertion  helps  to  improve  the  rate  of  convergence. 


C.  Reproduction 

Hank-based  fitness  assignment  sorts  the  individuals  in  a  descending  order  of  fitness  for  the  kth  generation,  Gk 
population  of  P  individuals.  Linear  crossover  is  performed,  where  two  parents  produce  two  children.  The  selection 
method  uses  Emperor-Selective  (EMS),  where  the  best  individual  in  Gk  gets  to  mate  with  every  other  even  sample 
in  the  population.  It  allows  the  fittest  individual  to  procreate  freely  with  practically  the  rest  of  the  population. 
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D.  Survival  Selection 


EMS  works  on  G*  to  produce  cPop ,  which  is  the  child  population  after  mating  and  crossover.  Concurrently,  a 
non-uniform  multi-modal  mutation  operator  is  applied  to  a  population  mPopl  which  comprises  of  X(X  <  P) 
copies  of  the  fittest  individual  prior  to  the  mating  operation  above.  The  Gaussian  distribution  parameter  5, 
which  is  inversely  proportional  to  the  size  of  mutational  changes  introduced,  is  adaptively  increased  once  stagnated 
growth  due  to  premature  convergence  is  detected.  The  same  mutation  operator  is  performed  across  a  copy  of  the 
original  population  Gk,  giving  mPop2 ,  the  mutated  child  population  based  on  Pm,  the  mutation  probability  which 
is  usually  greater  than  0.04. 

Unlike  [7],  only  the  best  Y(Y  <  P)  individuals  from  mPopl  and  mPop2  axe  selected  to  produce  tPop,  the 
population  of  best  individuals  produced  through  unary  transformation. 

Finally,  the  new  set  of  P  individuals  that  forms  the  next  generation,  Gfc+i ,  are  those  from  the  best  of  cPop  and 
tPop. 


E.  Fitness  Evaluation 


Fig.  1.  Template  cast  over  each  decoded  pattern  to  evaluate  its  fitness. 


A  template,  formed  by  the  shape  of  the  main  lobe  and  the  specified  SLL,  is  casted  over  the  array  pattern  produced 
by  each  candidate,  as  shown  in  Fig.  1,  to  compute  their  cumulative  difference  as  a  form  of  fitness  measure.  Thus, 
the  ideal  array  pattern  is  one  that  conforms  to  the  original  main  beam  shape  with  the  specified  SLL,  as  depicted 
in  Fig.  1. 


F.  Termination  Criteria 

The  maximum  number  of  generations  must  be  defined  together  with  the  desired  fitness  level.  By  satisfying  either 
one  of  the  above  criteria,  the  GA  will  terminate. 
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IV.  Simulation  Results 


A.  Circular  Array  of  16  Vertical  Dipoles 


X  0.4  -0.4 

Fig.  2.  A  circular  array  of  16  vertical  dipoles. 


For  ideal  array  pattern  synthesis,  the  solution  for  the  excitation  coefficients  can  be  obtained  by  the  constrained 
adaptive  algorithm  [12].  An  array  beam  obtained  is  shown  by  the  dashed  line  in  Fig.  3,  with  the  elevation  angle 


Azimuth  (degrees) 


Fig.  3.  Similarly,  the  original  pattern  (dashed  line)  becomes  deformed  (solid  line)  when  mutual  coupling  is  taken  into  consideration. 

Once  mutual  coupling  is  considered  after  running  the  NEC  simulation,  the  pattern  designed  with  ideal  weights  is 
severely  deformed  as  depicted  by  the  solid  line  in  Fig.  3.  In  this  example,  SLL  changed  from  the  original  —19.38 
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dB  to  -8.3  dB.  The  template  (dotted  line)  is  set  to  produce  a  corrected  pattern  of  -19  dB  SLL.  The  normalized 
excitation  coefficients  are  tabulated  in  Table  I  as  shown  below,  while  the  fitness  progress  is  plotted  in  Fig.  4. 


Fig.  4.  Fitness  progress  chart  obtained  after  averaging  the  results  from  30  runs. 


Azimuth  (degree) 

Fig.  5.  The  deformed  pattern  (dashed  line)  and  the  corrected  pattern  (solid  line). 

After  an  average  of  73  generations,  the  corrected  pattern  is  obtained.  Fig.  5  shows  the  deformed  pattern  (dashed 
line)  and  the  corrected  pattern  and  (solid  line).  The  SLL  is  recovered  to  -19  dB. 
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TABLE  I 

Normalised  circular  array  excitation  coefficients  for  compensated  power  pattern  by  the  GA 


Element  Number 

Original  Weights  Without 
Considering  Mutual  Coupling 

Compensated  Weights 
Considering  Mutual  Coupling 

1. 

0.01581695  +0.01154566j 

0.00210863  +0.01281295j 

2. 

0.01806623  +0.01385511j 

0.01026920  +0.00972533j 

3. 

0.01823458  -0.01058734] 

0.02178864  -0.00621535j 

4. 

0.00555620  +0.00417207j 

-0.00267502  -0.00622833j 

5. 

-0.01672706  -fO.OOOOOOOOj 

-0.01049456  +0.01226553] 

6. 

0.00555620  -0.00417207j 

-0.00628700  -0.01480830j 

7. 

0.01823458  +0.01058734j 

0.01766233  +0.01615022] 

8. 

0.01806623  -0.01385511] 

0.03292289  -0.00831566j 

9. 

0.01581695  -0.01154566j 

0.00850921  -0.02939043j 

10. 

0.01806623  -0.0138551 lj 

0.02530808  -0.00313775j 

11. 

0.01823458  +0.01058734j 

0.01971566  +0.00929316] 

12. 

0.00555620  -0.00417207j 

-0.00585220  -0.01205474j 

13. 

-0.01672706  +0.00000000j 

-0.00787380  +0.00133553] 

14. 

0.00555620  +0.00417207j 

-0.01053157+  0.00263694j 

15. 

0.01823458  -0.01058734j 

0.01740513  -0.00225400j 

16. 

0.01806623  +0.0138551  lj 

0.00983548  +0.00177143j 

V.  Conclusions 


The  proposed  mutual  coupling  compensation  approach  can  be  applied  to  any  other  array,  since 
its  operation  is  independent  of  the  array  configuration.  However  by  using  this  approach,  the 
algorithm  can  reduce  the  SLL,  and  at  the  same  time,  reshape  the  beam  to  any  desired  pattern 
by  simply  altering  the  shape  of  the  ideal  template. 
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Abstract 

A  three-objective  Pareto  genetic  algorithm  is  applied  to  the  design  of  practical  impedance 
modulated  feeds  for  log-periodic  monopole  arrays.  Specifically,  the  arrays  are  designed  to  have  a 
given  radiation  pattern  while  at  the  same  time  minimizing  both  the  feed  reflection  coefficient  and 
the  amount  of  energy  coupled  into  the  array  that  does  not  get  radiated.  The  use  of  a  Pareto  GA 
allows  for  a  study  of  the  tradeoffs  between  these  three  goals,  and  results  in  a  database  of  designs 
representing  optimal  combinations  among  them.  Numerical  results  illustrating  the  design  of  an 
array  mounted  on  an  airplane  wing  illustrate  the  optimization  process.  These  results  are  compared 
to  designs  resulting  from  more  traditional  optimization  methods  (a  simple  genetic  algorithm  and  the 
simplex  method)  and  are  shown  superior  in  all  respects. 

1  Introduction 

Since  their  invention  in  the  1960’s,  the  log-periodic  monopole  array  (LPMA)  has  become  a 
popular  antenna  because  it  is  easy  to  construct  and  has  nearly  frequency-independent  radiation 
characteristics.  LPMAs  are  usually  mounted  on  very  large  ground  planes  so  that  they  may  be 
designed  using  image  theory  and  log-periodic  dipole  array  (LPDA)  synthesis  techniques  [1,  2].  The 
assumption  of  a  large  ground  plane  thus  makes  standard  LPMA  design  methods  inapplicable  to  the 
design  of  arrays  mounted  on  more  complex  structures,  such  as  the  airplane  wing  model  shown  in 
Figure  1.  Mounting  an  LPMA  on  a  complex,  finite  platform  can  degrade  the  antenna’s  performance 
in  three  ways: 

•  The  array  pattern  can  be  altered  due  to  the  presence  of  the  platform. 

•  The  arbitrary  structure  of  the  platform  may  result  in  highly  reflective  feeds. 

•  The  truncation  of  the  array  may  lead  to  excessive  power  consumption  in  the  terminating 
load  of  the  feed. 

This  study  will  demonstrate  that  a  redesign  of  the  classical,  impedance  modulated  LPMA  feed  [3] 
suffices  to  ameliorate  these  problems.  This  redesign  is  accomplished  using  a  Pareto  genetic 
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algorithm  (PGA)  designed  to  contemporaneously  minimize  the  three  above-listed  deleterious  effects 
of& mounting  the  LPMA  on  a  finite  platform.  In  so  doing,  the  work  presented  here  illustrates  not  only 
the  power  of  the  PGA  applied  to  electromagnetic  design  problems,  but  also  that  the  design  of  an 
operating  LPMA  mounted  on  a  less  than  ideal  platform  can  be  accomplished  through  the  redesign 
of  the  feed  alone. 

Like  the  simple  genetic  algorithm  (SGA),  the  PGA  is  an  optimization  algorithm  based  on  the 
principles  of  natural  selection  discovered  by  Charles  Darwin.  Because  it  is  derived  from  the  SGA, 
the  PGA  retains  the  multifarious  appealing  properties  of  evolutionary  optimization  (e.g.  robustness 
in  avoidance  of  local  optima,  simplicity  of  coding,  ability  to  work  without  gradient  information, 
etc.).  Unlike  the  SGA,  however,  the  PGA  does  not  converge  to  a  single  (supposedly)  global 
optimum,  but  to  a  set  of  optima  representing  tradeoffs  between  the  various  design  goals.  Moreover, 
since  such  a  result  implies  a  more  thorough  search  of  the  design  space,  the  PGA  often  results  in 
better  designs  for  any  single  goal  than  an  SGA  provides  for  the  same  goal. 

2  Formulation 

In  this  section,  the  design  of  an  LPMA  feed  using  a  Pareto  GA  is  described.  Section  2.1 
discusses  the  optimization  problem  in  general  by  describing  the  feed  topology,  optimization 
parameters,  and  analysis  method.  Section  2.2  then  describes  Pareto  optimization  and  the  PGA. 

2.1  Log-Periodic  Monopole  Arrays 

Because  of  its  self-similar  nature,  a  standard  LPMA  (Figure  2)  consisting  of  N  monopoles 
can  be  described  fully  with  only  three  parameters.  If  the  length  of  the  nth  monopole  in  the  array  is 
denoted  by  l",  the  constant  parameter  r  describes  the  ratio  between  the  lengths  of  adjacent  elements 
as 


Similarly,  the  spacing  between  elements  n  and  n+1  is  related  to  l"  by  a  constant  parameter 

4a  =  ~f.  (2) 

Coupling  these  parameters  with  the  length  of  one  of  the  elements  is  sufficient  for  a  complete 
description  of  the  radiating  structure  of  the  LPMA.  Because  this  study  seeks  to  illustrate  that  no 
redesign  of  LPMA  itself  is  required  to  ensure  its  proper  operation,  the  geometric  progression  of 
element  lengths  and  interelement  spacing  is  given  by  equations  (1)  and  (2)  exactly. 

The  description  of  the  impedance-modulated  feed  (Figure  3)  is  considerably  more 
complicated.  The  feed  network  is  connected  to  a  source  with  impedance  ZSRC  near  the  smallest 
monopole,  and  is  terminated  in  a  load  impedance  ZL  near  the  largest  monopole.  Surrounding  each 
monopole  n  are  two  sections  of  transmission  line  of  impedance  ZnM  and  length  l?  referred  to  as 
modulated  lines.  Between  these  modulated  pieces  of  line  (and  before  and  after  the  first  and  last 
modulated  lines)  are  transmission  lines  of  impedance  Z0  and  length  .  (Because  there  are  N+ 1  of 
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these  non-modulated  lines,  Z°  is  the  length  of  line  before  monopole  n+1,  with  /°+1  denoting  the 
length  of  line  between  the  last  modulated  line  and  before  the  load  impedance.)  The  PGA  described 
in  the  next  section  will  design  the  feed  by  fixing  all  of  these  values.  Specifically,  the  impedance 
values  are  chosen  from  a  database  of  readily  available  components,  but  the  lengths  of  line  allowed 
are  allowed  to  vary  continuously. 

Like  all  optimization  algorithms,  the  PGA  requires  some  rating  of  the  worth  of  a  given 
design  to  operate.  Unlike  more  standard  algorithms,  however,  the  PGA  can  simultaneously 
optimize  many  several  functions  of  the  feed  parameters  to  determine  optimal  tradeoffs  between 
them.  In  the  case  of  LPMA  design,  the  minimization  of  three  quantities  is  sought:  a  weighted 
mean-square  difference  between  the  field  provided  by  a  given  feed  and  an  ideal  feed  (A),  the 
maximum  feed  reflection  coefficient  over  the  band  of  operation  (T),  and  the  percentage  of  energy 
coupled  into  antenna  that  is  dissipated  by  the  load  (E).  Thus,  Pareto  optimization  can  be  seen  as  a 
vector  optimization  criterion,  operating  on  the  “goal  vector”  [A,r,E] . 

Because  the  external  radiating  structure  (i.e.  the  monopoles  themselves  and  their  platform) 
does  not  change  during  the  course  of  the  optimization,  the  analysis  can  be  done  in  two  steps. 
Before  the  optimization  begins,  the  structure  is  analyzed  using  the  method  of  Moments  (MoM). 
Specifically,  for  several  test  frequencies  (Ok  (k  =  1,...,  Nf )  in  the  band  of  interest,  the  MoM  is  used 
to  extract  the  NxN  admittance  matrix  Y“'  of  the  N  element  array.  Furthermore,  the  far  field 
radiated  by  unit  excitation  of  monopole  n  in  the  direction  (fl.,0,.),  i  =  l,...,Nangle,  and 
frequency  k,  of  interest  can  be  stored  so  that  superposition  can  be  used  to  find  the  field  radiated  by 
the  element  excitations  established  by  an  arbitrary  feed. 

Given  this  information,  the  evaluation  of  the  feed  network  can  be  accomplished  very  simply. 
Using  standard  transmission-line  theory,  Ykeed ,  the  (iV  +  l)x()V  +  l)  admittance  matrix  of  the  feed 
describing  the  interaction  of  the  single  source  port  with  the  N  antenna  ports  can  be  extracted. 
Denoting  the  voltage  and  current  vectors  for  the  N  antenna  ports  by  V4fln'  and  1“' ,  and  the  current 
and  voltage  at  the  source  port  by  V/  and  I’ ,  the  current  and  voltage  at  each  of  the  ports  can  be 
related  by  the  equations 
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(3) 


as  the  feed  network  is  directly  connected  to  the  antennas.  (The  matrix  in  (3)  has  been  partitioned  to 
separate  out  the  source  interaction.)  These  partitioned  matrix  equations  may  be  solved  for  the  input 
admittance 

ykM =Y&d-  rfftrr + w 


and  the  voltage  at  each  antenna  terminal 
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vr=  =-(Yr+Y/r)"Y£.  (5) 

^k,N  _ 

The  field  strength  Fk(6n(p;)  computed  at  frequency  *  and  angle  (0^0,)  for  a  given  feed  is  then 
given  by 


Then,  A  is  defined  by 


F,(e„^)=2oiogEY,jF,l,(e„#1)  • 


A  =  5  ”2V.  (m.  )-  f“"  (»..*<  H  (»..♦< )■ 


where  F/”'(0,,<ft)  and  1^(0,-,^)  are  the  desired  far-field  amplitude  and  the  weight  assigned  to  the 
;th  angular  direction  at  the  k*1  frequency,  respectively.  The  weights  W*(0,  ,0;)  can  be  chosen  so  that 
the  resulting  values  of  A  range  between  0  and  1.  Additionally,  the  reflection  coefficient  magnitude 
and  inefficiency  at  frequency  k  are  given  by 

P  -  l~ZsxcYkJn  (g) 

*  -  i  +  z^y,.„ 


Re{(Y£f  )*[(Yr  +Yt?)-']'(Yr)‘(Yr  +  Yff  )"  Y^ 


where  the  asterisk  denotes  the  complex  conjugate  transpose  operation.  Finally,  the  feed  reflection 
coefficient  magnitude  and  feed  inefficiency  are  given  by 

T=  max  (r*) 

(10) 

E=  max  (EA 

***** 

Derivation  of  these  expressions  can  be  found  in  [4]. 


2.2  The  Pareto  Genetic  Algorithm 

In  the  previous  section,  three  minimization  goals  were  discussed  in  the  context  of  LPMA 
design.  The  PGA  is  an  algorithm  that  searches  for  all  optimal  combinations  of  these  goals.  Before 
describing  the  PGA,  therefore,  a  definition  of  optimality  with  respect  to  more  than  one  goal  is 
needed.  Such  a  definition  is  embodied  in  the  concept  of  Pareto  optimality,  which  is  described  next. 

Consider  two  LPMA  designs  a  and  b  with  goal  vectors  given  by  [A0,Ta,EJ  and 
[Afc,rfr,Efe],  respectively.  Design  a  is  said  to  dominate  design  b  if  Aa  <  Afc ,  Tu  <  Th ,  and  sua  <  ~b 
with  strict  inequality  holding  in  at  least  one  of  the  three  inequalities.  A  design  is  called  Pareto 
optimal  if  it  is  not  dominated  by  any  other  design  in  the  search  space,  and  the  set  of  all  such  designs 
is  called  the  Pareto  front. 
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Despite  their  multiobjective  optimization  abilities,  PGAs  are  very  similar  to  SGAs.  The  two 
algorithms  differ  only  in  the  implementation  of  the  selection  operator.  For  this  reason,  only  the 
selection  operator  is  described  in  detail  here. 

Like  the  SGA,  the  PGA  begins  with  a  random  initialization  of  a  population  of  Np 
chromosomes.  Each  chromosome  is  composed  of  a  string  of  bits  that  may  be  decoded  to  give  all  of 
the  design  parameters  described  in  the  previous  section  (see,  for  example,  [5]  for  the  decoding 
procedure).  Once  the  population  is  initialized,  the  goal  vector  of  each  population  is  evaluated  as 
described  above. 

After  the  population  is  evaluated,  the  selection  operator  is  applied.  The  Pareto  criterion 
poses  two  challenges  to  the  PGA:  To  return  the  entire  Pareto  front,  designs  must  be  compared  on 
the  basis  of  their  relative  Pareto  optimality,  and  at  the  same  time  the  population’s  tendency  to 
converge  to  a  single  design  must  be  counteracted  to  produce  an  algorithm  which  returns  a  complete, 
diverse  front. 


To  meet  the  first  challenge,  the  Pareto  ranking  scheme  of  Fonseca  and  Fleming  is  used. 
Each  population  member  is  assigned  a  rank  equal  to  the  number  of  designs  in  the  population  that 
dominate  it,  plus  one.  Thus,  all  nondominated  members  of  the  population  are  assigned  a  rank  of 
one,  with  the  rest  of  the  population  having  a  higher  rank. 


To  ensure  populational  diversity,  the  sharing  operator  is  employed.  Besides  a  Pareto  rank, 
each  member  i  of  the  population  is  assigned  a  niche  count  p,  which  measures  the  number  of  similar 
designs  surrounding  it.  Specifically,  the  distance  between  chromosomes  i  and  j  is  simply  the 
Euclidean  distance  between  their  goal  vectors  considered  as  position  vectors  in  3-space.  The  niche 
count  for  chromosome  i  is  then  calculated  as 

P,=Xs(d/)  ai) 

7=1 


where  the  sharing  function  S  is  defined  as 


S(d)  =  max 


1- 


(12) 


where  a  >  0  and  E  are  user-defined  parameters  that  determine  the  shape  of  the  sharing  function. 


After  niche  counts  and  Pareto  ranks  are  computed,  each  population  member  is  assigned  an 
objective  function  value  as  follows.  Members  of  the  first  rank  are  given  an  arbitrary,  identical, 
“pre-sharing”  fitness  value.  To  compute  the  fitness  of  a  certain  member  i  in  the  first  rank,  this  value 
is  then  divided  by  p,- .  The  members  of  the  second  rank  are  then  assigned  a  “pre-sharing”  value 
equal  to  the  lowest  “post-sharing”  value  of  the  first  rank  members.  The  process  then  continues  in 
this  fashion  until  all  members  have  been  assigned  a  value. 


Once  the  objective  function  values  of  the  entire  population  are  computed,  the  PGA 
continues  as  an  SGA.  Roulette-wheel  selection,  one-point  crossover,  and  the  usual  mutation 
operator  are  used.  The  process  is  terminated  after  the  Pareto  front  ceases  to  change. 
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3  Numerical  Results 

Using  the  PGA  technique  detailed  above,  an  LPMA  was  designed  for  the  wing  model  of 
Figure  1  with  a  band  of  operation  ranging  from  25  to  88  MHz.  The  LPMA  structure  consisted  of 
eight  monopoles,  with  t=  0.850,  <7=  0.096,  and  1‘  =  3.125  m.  An  ideal  pattern  was  supplied  in 
which  most  of  the  radiation  was  directed  outboard  from  the  wing  at  -30°  elevation.  The  remaining 
PGA  parameters  were  taken  as  NP  =  10,000 ,  (7  =  0.08 ,  E  =  1.75  and  Nf  =8  with  a  crossover  rate 
of  85%  and  a  mutation  rate  of  0.5%.  The  specific  frequencies  used  for  optimization  were  25.0, 
30.0,  35.0,  45.0,  55.0,  65.0,  75.0,  and  88.0  MHz.  The  PGA  ran  for  36  generations  before 
converging. 

In  addition  to  optimizing  the  feed  with  the  PGA,  both  the  SGA  and  simplex  methods  were 
also  used.  Because  neither  of  these  methods  is  specifically  tailored  to  Pareto  optimization,  both 
were  run  several  times  with  objective  functions  corresponding  to  different  linear  combinations  of 
the  three  goals.  To  ensure  a  fair  comparison,  both  the  SGA  and  the  simplex  method  were  allotted  in 
total  more  than  ten  times  the  36,000  function  evaluations  given  the  PGA.  The  Pareto  fronts 
generated  by  the  PGA  and  the  SGA  are  shown  in  Figure  4.  Notice  that  the  front  returned  by  the 
PGA  dominates  that  returned  by  the  SGA.  The  simplex  method  front  was  so  completely  dominated 
by  the  other  two  fronts  that  it  does  not  even  fit  on  the  same  graph. 

As  a  further  demonstration  of  the  superiority  of  the  PGA  method.  Figure  5  compares  the 
voltage  standing  wave  ratios  (VSWRs)  of  designs  selected  from  the  front  returned  by  each  of  the 
three  methods.  The  designs  compared  in  Figure  5  are  the  lowest  reflection  designs  from  the  fronts 
returned  by  the  SGA  and  the  simplex  method,  and  a  design  chosen  from  the  PGA  curve  which 
dominates  these  designs.  Figure  6  shows  the  patterns  of  the  three  designs  at  55  MHz.  Note  that  the 
PGA  design  has  the  lowest  backlobe.  More  data  will  be  presented  at  the  talk  to  further  compare  the 
three  methods. 

4  Conclusions 

A  PGA-based  method  for  the  design  of  feeding  networks  for  LPMAs  mounted  on  arbitrary 
bodies  was  outlined.  Given  the  LPMA  geometry  and  the  parameters  of  the  body  on  which  the 
LPMA  was  mounted,  the  presented  method  returns  a  good  approximation  to  the  Pareto  front 
consisting  of  optimal  tradeoffs  between  pattern  shape,  reflection  coefficient  and  feed  efficiency. 
The  algorithm  was  greatly  accelerated  by  the  precomputation  of  the  LPMA  admittance  matrix,  and 
the  field  radiated  by  each  monopole  excited  by  a  unit  voltage.  Results  of  the  application  of  the  PGA 
technique  were  compared  to  results  returned  by  an  SGA  and  the  simplex  method  that  demonstrated 
the  PGA  to  be  a  more  accurate  and  faster  method  of  Pareto  optimization. 
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Figure  1.  Log-periodic  monopole 
array  mounted  on  a  wing. 


Figure  2.  Standard  LPMA  mounted 
on  an  infinite  ground  plane, 
illustrating  geometric  parameters. 


Figure  3.  Impedance-modulated  feed  for  an  LPMA 


377 


Figure  4. 


Figure  5. 


Figure  6. 
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The  PGA  front  (shown  with  crosses  and  interpolated  surface)  and  the  dominated  iterated 
front(shown  with  circles). 
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Comparison  of  the  reflection  performance  of  three  feeds  designed  by  the  PGA,  the  SGA 


and  the  simplex  methods. 


Comparison  of  the  patterns  of  the  three  feeds  at  55  MHz.  The  simplex  feed  is  illustrated 
with  circles,  the  SGA  feed  with  triangles  and  the  PGA  feed  with  crosses. 
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Abstract 

In  this  paper,  a  novel  optimization  procedure  for  designing  Luneburg  lens  antennas  is  presented. 
One  of  the  important  engineering  objectives  of  designing  an  optimal  Luneburg  lens  antennas  is  to  use 
as  small  number  of  shells  as  possible  while  maintaining  an  acceptable  gain  and  sidelobe  performance. 
In  a  typical  uniform  design,  by  reducing  the  number  of  shells,  the  gain  is  decreased  and  the  grating 
lobes  are  increased.  This  deficiency  in  the  radiation  performance  of  the  uniform  lens  antenna  can  be 
overcomed  by  designing  the  non-uniform  lens  antenna.  This  necessitates  the  optimum  selection  of 
each  layers  thickness  and  permitivity.  A  Genetic  Algorithm  (GA)  optimizer  with  adaptive  cost 
function  is  implemented  to  obtain  the  optimal  design.  In  this  manner  the  GA  optimizer  simultaneously 
determines  the  optimal  material  and  its  thickness  for  each  shell  by  controlling  the  gain  and  sidelobe 
envelope  of  the  radiation  pattern.  The  results  have  been  satisfactory  and  demonstrate  the  utility  of  the 
GA/adaptive-cost-function  algorithm.  In  this  work,  various  lens  geometries  are  analyzed  by  using  the 
dyadic  Green's  function  of  the  multi-layered  dielectric  sphere. 

I.  Introduction 

For  beam  scanning  at  millimeter  wave  frequencies  in  mobile  and  satellite  communication  systems, 
there  has  been  some  interest  in  using  radially  symmetric  lenses  [1].  These  types  of  lenses  transform  the 
feed’s  spherical  radiation  pattern  to  a  plane  wave  [2].  Among  various  lenses,  the  Luneburg  lens  has 
received  much  attention.  Since  the  mathematical  design  principal  of  lens  antennas  is  based  on  the 
geometrical  optics  concept,  a  lens  antenna  can  typically  operate  in  a  broad  frequency  band.  On  the  other 
hand,  spherical  symmetry  of  the  lens  allows  for  multi-beam  scanning  usage  by  placing  an  array  of  feeds 
around  it.  While  both  radially  symmetric  lenses  and  phased  array  antennas  can  be  used  for  beam  scanning, 
the  latter  has  narrower  frequency  bandwidth. 

An  ideal  Luneburg  lens  consists  of  a  dielectric  sphere  with  varying  permitivity  ranging  from  2  at  the 
center  to  1  at  the  surface.  In  practice,  the  Luneburg  lens  antenna  is  constructed  as  a  multi-layered 
symmetrical  lens  that  focuses  the  feed  energy  efficiently.  However,  to  achieve  the  required  high 
performance  characteristics,  a  large  number  of  shells  may  be  required.  By  decreasing  the  number  of  shells, 
the  directivity  is  decreased  and  substantially  high  levels  of  unwanted  grating  lobes  are  generated.  This 
deficiency  in  the  radiation  performance  of  the  lens  can  be  improved  by  designing  a  non-uniform  lens  to 
overcome  the  grating  lobe  generations.  The  main  question  is  as  how  to  choose  the  optimum  width  and 
dielectric  constant  for  a  lens  with  reduced  number  of  shell  layers. 
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This  paper  focuses  on  the  optimum  synthesis  of  Luneburg  lens  antennas  using  the  genetic  algorithm 
(GA)  optimizer.  Since  it  is  desired  to  simultaneously  control  the  achievable  gain  with  reduced  grating  lobe 
level  structure,  an  adaptive  cost  function  is  used  in  the  process  of  GA  optimization.  To  determine  the 
scattered  field  of  the  lens  antenna,  an  analytic  technique  [3]  is  used  based  on  the  dyadic  Green’s  function  of 
the  multi-layered  spherical  shell.  The  main  features  of  this  paper  in  designing  the  non-uniform  lens  antenna 
are  summarized  in  the  following: 

>  Applying  the  construction  of  the  spherical  vector  wave  solution 

>  Integrating  the  genetic  algorithm  optimizer  with  the  vector  wave  solution  processes 

>  Defining  a  suitable  adaptive  cost  function  based  on  both  the  gain  and  sidelobe  envelop 

>  Presenting  representative  non-uniform  Luneburg  designs 

>  Discussing  some  of  the  engineering  aspects  and  features  of  the  results  obtained 

In  the  following  section,  the  formulation  of  the  underlying  electromagnetic  problem  is  presented.  In 
Section  HI,  some  unique  characteristics  of  the  uniform  Luneburg  lens  antenna  are  described.  In  Section  IV, 
a  non-uniform  lens  antenna  is  optimized  using  the  GA/adaptive-cost-function  technique  and  some  key 
aspects  of  this  implementation  are  discussed. 

II.  Computation  and  Optimization  Methodologies 
A.  Computation  Technique 

Solution  of  Maxwell's  equation  in  a  concentric  dielectric  sphere,  as  shown  in  Fig.  1,  is  determined 
by  using  the  dyadic  Green’s  function  of  a  multi-layered  dielectric  sphere  and  the  mode  matching 
technique. 


y 

Fig.  1.  Concentric  dielectric  sphere. 

The  total  electric  field  in  the  presence  of  dielectric  sphere  can  be  written  as 

E(r)  =  Ei(r)  +  E'1  (r)  =  Ge  (r,  r')  •  [-  jco^  i{r'j\dv  (1) 

where  G,  (r,r')  is  the  dyadic  Green's  function  responding  to  an  infinitesimal  dipole,  J  is  the  current 
density  of  the  source,  fi^  is  the  permitivity  of  region  1  containing  the  current  source,  and  E'  and  E* 
are  the  incident  and  scattered  fields,  respectively. 

The  scattered  electric  field  EJ  due  to  the  arbitrary  located  infinitesimal  dipole  Up ,  in  the  ith 
region  is  expressed  as 
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E(/>(r)  = 


(2) 


lie 2-Sm0) 

n=l  m=0 


(2n  +  l)(n-m)l 
n(n  +  l)(n  +  m)l 


+  {suN^(t,)+D,N^(t,)).(N:i(t1)-  p)j 


where  M  and  N  are  the  spherical  vector  wave  functions,  and  k;  and  k]  are  the  complex  propagation 
constants  in  regions  ith  and  1,  respectively.  The  unknown  coefficients  Ani,Bni,Cni,  and  Dni  are 
determined  by  applying  the  boundary  conditions  at  the  dielectric  interfaces.  Based  on  this  formulation 
and  by  using  an  accurate  computer  code  developed  in  [3],  the  electromagnetic  fields  in  all  regions  can 
be  determined. 


B.  Implementation  of  the  Genetic  Algorithm  with  an  Adaptive  Cost  Function 

The  genetic  algorithms  are  iterative  optimization  procedures  that  start  with  a  randomly  selected 
population  of  potential  solution,  and  gradually  evolve  toward  better  solution  through  the  application  of 
genetic  operators  that  are  selection,  crossover,  and  mutation.  GA  methods  are  global  optimizers  and 
have  some  unique  distinctions  with  respect  to  the  local  techniques  such  as  conjugate-gradient  and 
quasi-Newton  methods.  The  local  techniques  are  typically  highly  dependent  on  the  starting  point  or 
initial  guess,  while  global  methods  are  largely  independent  of  the  initial  starting  point.  Generally 
speaking,  local  techniques  tend  to  be  tightly  coupled  to  the  solution  domain,  resulting  in  relatively  fast 
convergence  to  a  local  maximum.  However,  this  tight  solution-space  coupling  also  places  some 
constraints  on  the  solution  domain,  such  as  differentiability  and/or  discontinuity  that  can  be  hard  or 
even  impossible  to  deal  with  in  practice.  The  global  techniques,  on  the  other  hand,  are  largely 
independent  of  the  solution  domain.  Consequently,  the  GA  method,  being  a  global  optimizer,  could  be 
an  efficient  technique  for  optimizing  the  new  electromagnetic  problems  having  discontinuities, 
constrained  parameters,  and  a  large  number  of  dimensions  with  many  local  optima  [4]. 


In  the  context  of  optimizing  the  radiation  performance  of  the  lens  antenna  with  m  spherical 
shells,  the  objective  or  fitness  function  F  is  defined  by 

where  en.  and  ti  represent  the  material  choice  for  each  shell  and  its  thickness,  respectively.  G(8)  is 
the  gain  pattern  of  the  lens  antenna  and  G0  is  its  maximum  value.  The  cost  function  F  attains  the 
maximum  gain  with  low  sidelobe  levels,  for  an  optimum  configuration.  By  a  proper  setting  of 
parameters  a,0,  and  the  sidelobe  envelop  function  /(0),  the  gain  and  sidelobe  levels  can  be 
controlled  effectively. 


In  order  to  apply  the  GA  method  to  the  lens  structure,  material  and  thickness  of  each  shell  are 
represented  in  a  binary  code  as 
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The  entire  structure  is  subsequently  represented  by  the  sequence  G  =  LlL2...Lm .  In  the  cases  studied  in 
this  paper,  N ^  -  Nlb  =  10  and  a  population  with  size  Npop  =  100  is  selected.  For  each  case,  EM  fields 
and  the  cost  function  F  are  computed  accurately.  In  the  process  of  our  GA  implementation,  crossover 
with  0.6  <  pCWS5  <0.9  and  mutation  with  0.01  <  pmu,  <0.1  are  used  [5]-[7].  Additionally,  in  this 
design  tournament  selection  and  elitism  are  also  employed. 

III.  Characteristics  of  Uniform  Luneburg  Lens  Antennas 

A  Luneburg  lens  antenna  transforms  the  point  source  radiation  into  the  plane  wave  and  vice  versa. 
An  ideal  Luneburg  lens  consists  of  a  spherically  symmetric  dielectric  sphere  with  continuous  varying 
permitivity  from  2  at  the  center  of  inner  core  to  the  1  at  the  outer  edge,  i.e.,  £r  ~  2- (r/af ,  where  a  is 
the  radius  of  the  sphere.  In  practice,  however,  the  Luneburg  lens  is  constructed  by  a  finite  number  of 
spherical  shells,  retaining  reasonable  performance.  Table.  1  shows  the  material  and  thickness  for  a  5- 
sheI130A  diameter  uniform  lens.  In  practice  feed  horns  or  open  ended  waveguides  are  used  to 
illuminate  the  lens.  In  this  paper,  in  order  to  model  the  radiation  pattern  of  a  typical  horn  antenna,  an 
end-fire  antenna  consisting  of  four  infinitesimal  dipoles  is  used  as  shown  in  Fig.  2. 


Table  1 .  5-shell  30A  diameter  Luneburg  lens 
antenna.  (Uniform  Design) 


30A 

Fig.  2: 5-shell  30A  diameter  Luneburg  lens  antenna. 

For  a  Luneburg  lens  antenna  with  fixed  diameter,  the  radiation  performance  is  improved  by 
increasing  the  number  of  shells,  but  the  performance  becomes  saturated  with  ~  lA  thickness  for  each 
shell.  Radiation  pattern  of  a  1 0-shell  30A  diameter  lens  is  shown  in  Fig.  3.  As  seen  the  gain  is  about 
37.73  dB.  Note  that  due  to  the  existence  of  the  periodic-like  uniform  shells,  there  are  some  grating 
lobes  in  the  pattern  occurring  around  9  ~  30° .  By  decreasing  the  number  of  shells  the  thickness  of 
each  shell  is  increased  and  so  the  grating  lobes  are  moved  closer  to  the  main  beam,  as  seen  for  a  5- 
shell  lens  (see  Fig.  3).  In  this  case  the  gain  of  the  lens  is  decreased  to  36.46  dB  and  the  grating  lobes  are 

occurred  closer  to  the  main  beam,  around  6  = 17° . 

From  these  plots,  it  is  observed  that  a  10-shell  30 A  lens  antenna  has  superior  performance.  From 
the  practical  point  of  view,  a  5-shell  lens  antenna  has  less  number  of  shells  and  so  its  construction  is 
easier.  However,  as  mentioned  earlier,  due  to  the  larger  thickness  of  each  shell,  the  grating  lobes  are 
occurred  nearer  to  the  main  beam.  The  radiation  performance  of  the  5-shell  uniform  Luneburg  lens 
antenna  can  be  improved  by  designing  a  non-uniform  lens  using  the  GA  optimization  method,  as 
described  in  the  next  section. 
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'A 
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3 
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1.74 
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1.90 
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Eg.  3:  Gain  pattern  of  the  30A  Lonebuig  lens  antenna  in  the  X—  Z  plane.  Note  the  appearance  of  the  unwanted  grating  lobes. 


IV.  Design  of  Non-Uniform  Lens 

This  section  focuses  on  the  key  design  aspects  of  a  non-uniform  Luneburg  lens  antenna  using  the 
genetic  algorithm  (GA).  The  goal  is  to  determine  the  permitivity  and  thickness  of  each  spherical  shell 
for  a  5-shell  30A  diameter  lens  antenna  such  that  the  maximum  gain  is  achieved.  However,  besides 
maximizing  the  gain,  the  level  of  sidelobes  should  remain  low.  The  problem  then  is  to  investigate  the 
tradeoffs  between  these  conflicting  objectives. 

In  this  study,  the  number  of  shells  is  m  =  5 ,  and  eri  is  allowed  to  change  between  1.0  to  2.0.  For 
achieving  the  maximum  directivity  the  parameters  a  and  ft  are  chosen  1  and  0,  respectively.  Fig.  4 
shows  a  typical  convergence  curve  for  the  GA  method  applying  it  to  a  5-shell  30/1  diameter  non- 
uniform  Luneburg  lens  antenna.  The  cost  function  F  is  the  antenna  gain  G0 .  As  seen  the  GA  method 
determines  the  optimum  material  and  thickness  for  each  spherical  shell  rapidly.  Table  2  shows  the 
optimum  material  and  thickness  for  the  lens  antenna.  Gain  of  the  lens  has  been  plotted  in  Fig.  5.  It  is 
observed  that  the  gain  is  increased  to  G0  =37 .47  dB  and  the  grating  lobe  is  decreased  about  6.5  dB 
with  respect  to  the  uniform  design.  However,  the  level  of  grating  lobe  is  noticeably  higher  than  the 
grating  lobe  level  in  a  10-shell  uniform  design.  This  grating  lobe  level  can  be  further  reduced  by  using 
the  parameter  ft  and  function  f(P).  As  noted,  the  10-shell 30A  lens  has  had  a  monotonically  decaying 
sidelobes.  Therefore,  the  function  /  is  taken  as  the  envelope  of  these  sidelobes  in  the  following  form 

f(6)  =  12  -  l%Log{e ’/5.8‘  )(dB) .  (5) 

In  this  case  a  useful  design  is  obtained  by  choosing  a  =  1,/J  =  0.5.  The  goal  is  achieving  the 
maximum  gain  while  the  level  of  sidelobes  is  also  low,  similar  to  a  10-shell  lens.  In  order  to  create  an 
adaptive  cost  function  in  the  process  of  the  GA  optimization,  in  the  first  20  generations  the  cost 
function  F  is  taken  as  the  antenna  gain  (i.e.,  (5  =  0).  Next  the  effect  of  both  gain  and  sidelobe  level 
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together  are  used  in  the  process  of  the  optimization.  This  helps  the  optimization  method  to  keep  the 
antenna  gain  high  while  decreasing  the  sidelobe  levels.  The  results  are  presented  in  Table  3  and  Fig.  6. 
Note  that  the  function  /  was  chosen  to  control  the  sidelobe  levels  in  the  angular  range  2.5  <6  <  25 
at  50  points.  Fig.  6  shows  that  the  gain  is  about  35.94 dB  and  the  grating  lobes  are  decreased  about 
12  dB  with  respect  to  the  uniform  design.  The  effectiveness  of  this  adaptive  optimization  technique  is 
shown  in  Fig.  7  by  varying  a  and  j8  in  Eq.  (3).  It  is  observed  that  simultaneous  application  of  a  and 
(3  in  the  cost  function  provides  the  best  result. 


Fig.  4:  Convergence  curve  of  the  GA  method  applied  to  the  5-shell  3<U  diameter  non-uniform  lens.  ( F  -  G0  ) 


Table  2.  Characteristics  of  a  5-shell  30A  diameter 

non-uniform  Luneburg  lens  antenna.  (  F  =  Go ) 
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Fig.  5:  Gain  pattern  of  the  5-shell  30A  diameter  non-uniform  Luneburg 
lens  antenna  in  the  X~  Z  plane. 
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Table  3.  Characteristics  of  a  5-sheII  30A  diameter 
non-uniform  Luneburg  lens  antenna. 


(F  =  Gb+O.5Mm[/(e)-G(04^^B) 


Shell 

er 

'A 
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1.075 

1.819 
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1.191 

1.232 
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1.725 

3.710 
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1.815 

1.615 

5 

1.948 

6.624 

e  (degrees) 


Fig.  6:  Comparison  between  10-shell  and  S-shell  uniform  design  with  the 
5-shell  non-uniform  optimi2ed  lens  in  the  x  —  Z  plane. 


( F  =  Go+0.5  Mn[m-GmUhb'^on  > 


9  (degrees) 

Fig.  7:  Radiation  performance  of  the  5-shell  non-uniform  optimized  lens  using  various  (X  and  /?  in  Eq.  3. 
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VI.  Conclusions 


In  this  paper,  non-uniform  Luneburg  lens  antennas  are  designed  using  the  genetic  algorithm 
optimization  method.  The  multi-layered  lens  antenna  is  analyzed  accurately  using  the  dyadic  Green's 
function  and  the  mode  matching  technique.  Since  the  genetic  algorithm  produces  the  global  solution 
without  requiring  much  information  about  the  solution  domain,  it  has  been  found  to  be  a  very  efficient 
method  for  obtaining  the  optimum  material  and  thickness  for  each  spherical  shell  of  the  lens  antenna.  The 
radiation  performance  of  the  non-uniform  lens  was  improved  with  respect  to  the  uniform  design 
efficiently.  By  choosing  an  appropriate  cost  function,  the  gain  of  the  5-shell  30A  lens  was  increased 
about  \dB ,  and  also  its  grating  lobes  decreased  6.5  dB  with  respect  to  the  uniform  design.  By 
optimizing  the  lens  for  both  directivity  and  sidelobe  level,  the  grating  lobes  decreased  about  12  dB 
with  respect  to  the  uniform  design. 

Although  this  paper  described  an  optimization  over  a  continuous  parameter  set,  GA  method  is  capable 
of  optimizing  the  discrete  structural  parameters.  This  means  that  by  using  the  GA  method,  the  best  material 
and  thickness  for  the  lens  can  be  picked  up  among  a  finite  set  of  materials  in  a  database.  By  choosing  the 
appropriate  cost  function,  one  can  optimize  the  lens  antenna  based  on  the  nature  of  the  desired  application. 
In  addition,  a  Pareto  GA's  [8]  may  be  used  to  further  refine  the  design  by  creating  a  set  of  optimized 
solutions.  In  summary,  the  methodologies  developed  in  this  paper  will  allow  efficient  design  of  multi¬ 
layered  lens  antennas. 
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Abstract 

When  using  the  Method  of  Moments  (MoM)  to  solve  scattering  problems,  one  of  the 
crucial  factors  is  the  appropriate  choice  of  basis  functions.  A  judicious  choice  of  the  basis 
functions  can  lead  to  a  sparse  impedance  matrix,  or  to  a  sparse  solution  vector,  both  of  which 
will  reduce  the  computational  burden.  However,  pulse  basis  functions  are  usually  used,  both 
due  to  their  simple  description,  and  because  there  are  already  many  software  tools  written  to 
this  aim.  Therefore,  some  efforts  had  been  recently  invested  in  transforming  a  formulation 
that  is  the  result  of  using  pulse  basis  functions,  to  one  using  preferred  basis  functions.  This 
transform  produces  grouping  of  the  pulse  functions  into  a  new,  more  appropriate,  set  of 
basis  functions.  This  paper  discusses  the  search  for  an  optimal  grouping,  in  terms  of  the 
sparseness  of  the  resulting  solution  vector.  Various  options  are  demonstrated  and  discussed, 
and  numerical  examples  are  given. 


1  Introduction 

The  Method  of  Moments  (MoM)  is  widely  used  for  the  solution  of  scattering  problems,  but  one  of 
its  greatest  limitations  is  the  resulting  large  and  dense  impedance  matrix.  This  imposes  constraints 
both  in  terms  of  storage  and  solution-complexity  of  the  impedance  matrix.  Most  of  the  methods 
to  relieve  these  constraints  are  concerned  with  rendering  the  matrix  sparse,  or  otherwise  exploit 
its  structure. 

A  common  approach  to  reduce  solution  complexity  is  to  divide  the  matrix  into  regions  (sub¬ 
matrices)  which  describe  close  interactions  (sources  and  testing  points  are  near),  and  far  inter¬ 
actions  (sources  and  test-points  are  far  apart)  [1,  2,  3,  4].  A  different  approach  is  that  of  using 
special  basis  functions  such  that  the  resulting  impedance  matrix  will  have  only  small  number  of 
dominant  terms.  Thus,  thresholding  the  matrix  will  lead  to  a  sparse  matrix,  with  yet  almost 
no  degradation  in  the  result.  An  example  is  the  use  of  wavelets  as  basis  functions  [5].  In  the 
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general  category  of  selecting  special  basis  functions  also  falls  the  Impedance  Matrix  Localization 
(IML)  method  of  Canning  [6.  7].  In  this  method  one  performs  a  basis-transformation  in  order  to 
transform  the  impedance  matrix  into  one  with  only  a  few  dominant  terms,  and  then  performs  a 
threshold  operation  to  arrive  at  a  sparse  matrix. 

A  different  point  of  view  was  recently  suggested  in  the  Impedance  Matrix  Compression  (IMC) 
method  [8.  9].  In  the  IMC  one  seeks  to  use  basis  functions  such  that  the  resulting  solution  vector  is 
sparse.  This,  in  turn,  enables  the  use  of  a  much  smaller  (in  dimensions)  impedance  matrix,  which 
is  much  easier  to  solve  then  the  original  impedance  matrix.  This  is  to  be  contrasted  with  the 
previously  mentioned  methods  which  sought  a  sparse  matrix.  The  matrix  and  vectors  used  in  the 
IMC  are  transformations  of  their  counterparts  for  the  existing  pulse-basis  function  formulation. 

In  this  paper  we  will  look  into  what  kind  of  basis  functions  should  be  used  for  the  IMC,  in  order 
to  have  the  most  possible  sparse  solution  vector.  We  will  start  by  formulating  more  precisely  the 
problem  at  hand,  and  will  then  proceed  to  description  of  possible  solutions.  Numerical  examples 
will  then  be  presented,  followed  by  summary  and  conclusions. 


2  Problem  formulation 

To  make  the  following  discussion  more  lucid,  let  us  limit  ourselves  to  a  simple  scattering  scenario, 
as  described  in  Fig.  1(a).  The  figure  describes  an  incident  EM  wave  with  known  components 


Hinc 


x 


(b) 


Figure  1:  (a)  General  problem  scenario,  (b)  Plate  scenario. 


(Einc,H,nc).  (In  the  following,  bold-face  font  (E)  will  denote  spatial  vectors,  namely  vectors  with 
three  coordinates  corresponding  to  some  Cartesian  system,  whereas  the  usual  notation  (I )  will 
be  used  to  denote  any  other  vector).  The  incident  wave  is  a  TMZ  (Transverse  Magnetic  to  z)  EM 
wave,  for  which  we  can  write  Einc  =  E fcz,  and  Hinc  =  Tlfcx  +  H fcy.  The  scatterer  is  an  infinite 
cylinder  lying  with  its  axis  along  the  z  axis,  and  it  is  assumed  to  be  a  perfect  electric  conductor, 
with  a  known  (yet  arbitrary)  cross  section.  The  perimeter-line  of  the  cylinder  is  denoted  by  £,  and 
the  length  parameter  along  this  line  is  denoted  by  l  6  [0,  /maT] .  On  the  perimeter  of  the  scatterer 
there  exists  an  induced  electric  current  J  =  Jzz,  which  is  the  unknown  to  be  solved  for.  As  can 
be  seen  in  the  above  formulation,  both  the  orientation  of  the  impinging  wave,  and  the  specific 
cross-section  of  the  scatterer,  were  left  unspecified.  The  set  of  scattering  problems,  obtained  by 
specifying  these  quantities,  will  be  denoted  as  S. 

The  scattering  problem  is  then  reduced  to  a  matrix  equation,  using  the  MoM.  In  what  follows, 
we  will  assume  that  pulse  basis  functions  are  used  in  conjunction  with  the  Galerkin  method,  where 
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the  pulse  basis  functions  are  defined  as: 


ft(0  = 


1  if  Z  €  Rk 
0  otherwise 


Rk  = 


[*zli 

1  M 


k  —  1, 2,  •  ■  ■ ,  M 


(1) 


The  resulting  matrix  equation  is 

[Z]I=V  ,  (2) 

where  [Z]  is  the  impedance  matrix,  I  is  the  vector  of  unknown  coefficients,  and  V  is  the  excitation 
vector.  In  line  with  the  idea  of  basis  transformation,  or  grouping,  we  introduce  the  transformation 
matrix  [W],  which  is  real  and  unitary,  and  the  following  notations: 


\Z\  —  [£][PF]T  ,  (where  [W\T  stands  for  the  transpose  of  [W]j 

i  =  [w]i  . 


(3) 

(4) 


Writing  the  impedance  matrix  equation  in  terms  of  the  transformed  quantities,  we  get 


[z]i  =  v  . 


It  is  important  to  note  that  the  two  terms,  "transformation-matrix”  and  ”grouping-of  basis  func¬ 
tions”,  are  synonymous.  The  latter  however  is  a  more  explicit  way  of  describing  the  meaning 
of  the  transformation  matrix,  as  applied  to  the  electromagnetic  formulation.  We  will  later  see 
that  this  understanding  of  the  electromagnetic  characteristic  features  embedded  in  the  problem, 
enables  us  to  come  with  a  feasible  solutions. 

Thus,  following  the  above  notations  and  explanations,  our  goal  can  be  stated  as  follows: 

Find  a  set  of  basis  functions  [W]  which  will  enable  a  sparse  representation  of  the  currents 

encountered  in  the  set  of  scattering  problems  S. 

However,  an  immediate  problem  is  the  fact  we  do  not  know  ahead  what  the  currents  are.  One 
possible  remedy  is  to  assume  a  certain  probability  distribution  over  the  possible  currents,  and  try 
and  choose  the  best  basis-functions  accordingly.  This  option  requires  decision  over  a  model  for 
the  current.  Another  option,  which  does  not  explicitly  assumes  a  certain  model  of  the  currents, 
uses  a  large  sample  of  ’typical  currents’,  which  hopefully  will  serve  as  a  good  representatives  of  the 
currents  one  will  encounter  in  practical  problems.  This  option  will  be  used  here.  The  difficulty- 
using  this  method  is  the  question  of  how  to  choose  these  ’typical  currents’.  This  is  exactly  the 
point  where  our  physical  understanding  of  the  problem  at  hand  gives  us  an  edge  over  some  abstract 
formulations,  and  it  will  be  addressed  later  on  in  Sec.  4.  At  this  stage  we  will  assume  we  have  a 
set  of  N  ( M  <  N )  such  ’typical  vectors’, { and  we  order  them  columnwise  in  a  matrix  form 


to  get  a  matrix  [J)m*n- 

Our  goal  can  now  be  stated  as  follows: 

I  Find  a  set  of  basis-functions,  described  by  a  real  unitary  matrix  [W],  such  that 


[Wn  =  argmm 


E  Spar([»y]^) 

Li=l:W 


(6) 


where  Spar  (if)  is  a  sparseness-measuring  function  which  evaluates  the  sparseness  of  x  (the  j 
lower  the  value  is,  the  more  sparse  the  vector). 
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The  sparseness-measuring  function  is  of  course  crucial  to  the  determination  of  [W  ]  and  the 
whole  optimization  process.  A  few  possible  choices  for  the  sparseness  function  are  indicated  in 
[10].  We  will  use  a  function  such  that  the  function  value  is  the  number  of  elements  in  x  which 
convey  90%  of  its  energy  (the  l2  norm  of  x).  As  we  will  see,  this  is  a  practical  measure,  but  not 
that  comfortable  analytically. 

In  the  next  section  we  will  describe  various  options  for  selecting  [W],  and  discuss  their  merits. 


3  Various  possible  groupings 

Rather  than  performing  the  optimization  in  (6)  directly,  we  will,  in  this  paper,  resort  to  examining 
a  few  of  the  possible  solutions.  This  will  give  us  a  valuable  insight  into  the  possible  solution. 


3.1  Karhunen-Loeve  (KL) 

The  KL  transform  is  actually  a  Singular  Value  Decomposition  (SVD)  analysis,  when  applied  to 
probabilistic  sources  [ll].  Given  a  matrix  [J]mxN  (where  M  <  N ),  let  us  denote  its  SVD  by 
the  triplet  [£/]MxN,  [S]nxk,  [VJjvxN,  where  [ U ]  and  [V]  are  unitary,  and  [S]  is  diagonal.  We  will 
further  assume  that  the  (M  non-zero  )  singular-values  are  arranged  in  a  descending  order  on  the 
diagonal  of  [5].  Thus,  we  can  write 

[J\  =  [l/][S][Vf  .  (7) 


It  is  also  possible  to  express  the  matrix  [J]  as  a  weighted  sum  of  the  columns  of  [U]  and  rows  of 

[VI7-  namely  ,  , 

(8) 


[j]«=  £  stMain* 

k=l:M 


Thus,  one  can  see  that  according  to  the  last  equation,  if  we  were  to  choose  the  best  Q  vectors 
which,  on  the  average,  will  yield  the  best  l2  approximation  of  the  currents  represented  in  [J\.  then 
these  vectors  are  the  first  Q  column- vectors  of  [£/].  However,  this  was  not  our  goal  as  stated  in 
(6).  Though  this  might  be  viewed  as  some  kind  of  sparseness  requirement,  it  over  looks  the  fact 
that  in  our  formulation  of  the  goal,  the  Q  vectors  chosen  to  describe  best  a  signal  could  be  drawn 
from  a  pool  of  M  vectors,  and  thus  they  do  not  have  to  be  the  same  Q  vectors  for  all  the  set  of 
N  vectors. 


3.2  Wavelets 

Wavelets  are  a  set  of  basis  functions,  which  have  some  very  useful  properties  for  the  sparse 
representation  of  natural  signals,  and  have  also  been  used  for  the  description  of  currents  [5,  12]. 
They  are  localized  both  in  terms  of  their  spatial-support  area,  and  in  terms  of  their  spatial- 
frequency  content.  There  are  also  fast  transform  methods,  which  are  based  on  the  hierarchical 
structure  of  the  basis  functions.  For  the  above  reasons,  we  choose  to  examine  their  suitability  to 
the  problem  at  hand.  Various  types  of  wavelets  will  be  used  (Haar,  Daubechies,  and  Coifmans) 
in  the  sequel. 

Wavepackets  are  variants  of  the  wavelets,  and  when  taken  to  the  extreme,  they  can  be  viewed 
as  Fourier  basis  functions  on  an  interval.  Namely,  their  spatial-support  is  very  wide  (relative  to 
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wavelets),  while  their  spatial-frequency  support  is  narrow.  We  will  not  probe  into  their  specific 
usage  here,  because,  as  will  turn  out,  the  KL  basis-functions  which  results  in  our  case  are  very 
similar  to  the  wavepackets. 

3.3  Optimal  solution 

A  much  more  comprehensive  optimization  technique,  which  will  be  described  in  the  talk  and  in  a 
forthcoming  paper,  goes  along  the  lines  of  [13].  To  this  aim,  one  defines  a  smoother  cost  function 
(than  (6)),  and  then  uses  its  analytic  derivative  to  enable  a  gradient-descent  type  of  optimization 
algorithm. 


4  Numerical  example 

The  first  issue  to  address  is  the  choice  of  the  ’typical  currents’,  namely  the  building  of  the  matrix 
[J).  As  we  mentioned  earlier  in  Sec.  2,  we  are  dealing  with  a  set  of  scattering  problems  S. 
For  our  purpose,  we  choose  as  a  typical  scatterer  a  plate,  as  described  in  Fig.  1(b).  Note  that 
this  scatterer  contains  the  two  extreme  cases  of  curvature:  an  edge  (curvature^  0)  and  a  straight 
segment  (curvature=  oo).  The  resulting  currents  were  computed  for  31  various  angles  of  incidence 
(0  :  3  :  90  degrees).  To  keep  the  description  in  what  follows  simple,  we  took  only  the  first  and  last 
N  =  16  samples  of  each  of  the  resulting  currents  in  this  case  (doubling  the  number  of  vectors  to 
62).  Furthermore,  we  split  them  to  real  and  imaginary  parts  (another  double),  and  included  all 
circular  rotations  (16  possible),  which  amounts  to  a  total  of  1984  (—  31  *  2  *  2  *  16)  representing 
functions.  Four  of  the  resulting  columns  of  the  matrix  [J]  are  depicted  for  example  in  Fig.  2. 

The  SVD  decomposition  of  the  this  matrix  was  then  computed,  and  the  16  eigen- vectors,  as 
described  by  the  columns  of  [17],  are  depicted  in  Fig.  3.  Note  that  these  are  in  essence  (apart  from 
numerical  errors)  cosine/sine  basis  functions.  The  reason  for  that  is  that  the  SVD  of  a  circulant 
matrix  yields  cosine/sine  basis  functions,  and  in  the  above  case  [J]  is  a  concatenation  of  circulant 
matrices. 

The  performance  of  the  conventional  pulse  basis,  the  basis  which  resulted  from  the  SVD 
analysis,  and  the  wavelet  basis,  are  described  in  Fig.  4.  In  the  plot  only  the  results  for  the  first  64 
vectors  of  [J]  are  described,  rather  then  the  whole  1984  vectors,  just  in  order  to  make  it  readable. 
In  this  figure  the  number  of  basis  functions  needed  to  describe  each  of  the  currents  adequately 
(to  within  90%  of  its  energy)  is  described.  We  see  that  the  conventional  pulse  basis  functions 
performs  the  poorest,  then  comes  the  KL,  and  much  better  performs  the  wavelet  basis  functions. 
It  is  interesting  to  note  that  the  use  of  various  types  of  wavelets  (Haar,  Daubechie,  Coifman) 
did  have  a  significant  impact  on  the  results.  For  example,  let  us  compare  the  avarage  number  of 
coefficients  needed  to  describe  the  vectors  in  [J]  well  enough:  For  the  case  of  pulse  basis  functions, 
an  average  of  11.7  coefficients  were  needed  (out  of  a  total  of  164  coefficients);  using  the  SVD  basis 
functions  resulted  an  average  of  10.96;  using  the  Haar  wavelets  led  to  an  average  of  9.5,  whereas 
using  the  Daubechie-3  wavelets  led  to  an  average  of  7.95.  Other  wavelets  were  tried  out,  but  the 
results  were  similar  to  that  of  the  Db3  wavelets. 

In  Figs.  5-6  the  application  of  the  above  bases  to  two  scattering  problems  are  described:  One  is 
scattering  from  a  circular  cylinder,  of  radius  1A,  and  the  other  is  scattering  from  a  square  cylinder, 
with  side-length  of  1.5 A.  The  scattering  problems  were  solved  for  31  different  angles  of  incidence 
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(0  :  3  :  90).  In  both  cases,  the  wavelet  basis  functions  show  better  performance  then  the  KL  basis 
and  the  pulse  basis  functions. 

5  Summary  and  Conclusions 

This  paper  presented  a  methodology  for  finding  the  best  basis  functions  to  be  used  in  a  set  of 
scattering  problems.  The  basic  idea  is  to  use  ’typical  currents’  to  the  problem  at  hand,  and  then 
find  the  appropriate  basis  functions  for  these  ’typical  currents’.  It  was  shown  that  a  conventional 
KL  (SVD)  approach  is  not  appropriate,  and  that  wavelets  performs  much  better.  An  optimization 
problem  was  formulated,  the  solution  of  which  will  yield  the  best  basis.  The  solution  of  this 
optimization  problem  is  now  under  way,  and  the  application  of  these  methods  to  three-dimensional 
problems,  where  the  initial  basis  functions  are  either  triangular  patches  or  square  patches,  will  be 
done. 
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Figure  2:  Four  of  the  resulting  columns  of  the  matrix 
[J].  (a-b)  The  real  and  imaginary  parts  of  the  current 
at  an  angle  of  45  deg.  (c-d)  Two  circulant  rotations  of 
the  currents  in  (a). 


Figure  4:  Sparseness  of  the  columns  of  [J]  in  the  three 
case:  No  basis  transformation  applied  (dotted  line)  ;  ba¬ 
sis  transformation  which  resulted  from  the  SVD  analysis 
(solid  line);  and  wavelet  (Daubeschie-3)  basis  transfor¬ 
mation  (dashed  line). 
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Figure  3:  The  16  eigen-vectors,  as  described  by  the 
columns  of  [C7],  In  each  subplot  the  appropriate  singular- 
value  is  given. 


Figure  5:  Scattering  from  a  circular  cylinder.  The 
performance  in  the  three  case:  No  basis  transformation 
applied  (dotted  line)  -  average  cost  of  32.0  (out  of  64 
coefficients)  ;  basis  transformation  which  resulted  from 
the  SVD  analysis  (solid  line)  (Avg.  31.3);  and  wavelet 
(Daubeschie3)  basis  transformation  (dashed  line)  (Avg. 
25.2). 


Figure  6:  Scattering  from  a  square  cylinder.  The  per¬ 
formance  in  the  three  case:  No  basis  transformation 
applied  (dotted  line)  -  average  cost  of  32.7  (out  of  64 
coefficients);  basis  transformation  which  resulted  from 
the  SVD  analysis  (solid  line)  (Avg.  38.9);  and  wavelet 
(Daubeschie-3)  basis  transformation  (dashed  line)  (Avg. 
24.5). 
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ABSTRACT:  GAs  provides  a  high  level  of  robustness  by  simulating  nature’s  capability  of  adapting  to  many 

different  environments.  Through  the  application  of  GAs  to  design  optimization  problems,  the  performance 
characteristics  of  the  GAs  are  shown  to  be  powerful  in  solving  optimization  problems  with  high  dimensional 
objective  functions  containing  several  local  minima.  As  they  use  only  the  fitness  values,  GAs  does  not  require 
derivatives  or  any  other  additional  information  about  the  objective  function.  The  performance  of  GAs  can  be 
improved  significantly  by  applying  selective  pressure  and  adequately  providing  a  shape  modeling  technique  to 
control  the  range  of  variation  for  the  shape  to  be  optimized. 


1.  INTRODUCTION 

The  need  to  solve  electromagnetic  problems  accurately  and  efficiently  for  large-scale  complex  problems  has 
caused  the  emergence  of  computational  electromagnetics.  The  methods  developed,  such  as  the  finite  difference 
method,  the  method  of  moments,  the  finite  element  method,  and  the  boundary  element  method,  have  become 
increasingly  sophisticated  with  ever  increasing  computational  power.  These  numerical  techniques  are  used  by 
researchers  as  indispensable  tools  to  compute  the  field  parameters  and  predict  the  performance  of  devices  in  all  areas 
of  electromagnetics.  These  applications  range  from  two-dimensional  electrostatics  to  nonlinear  eddy  current 
calculations  in  three  dimensions  and  to  high  frequency  applications.  The  tremendous  advancements  in  computer 
technology  make  it  possible  to  enhance  these  analysis  tools  with  state  of  the  art  modeling  and  post-processing 
capabilities  to  offer  the  researcher  or  the  engineer  everything  that  is  necessary  to  analyze  an  electromagnetic  problem 

in  a  user-friendly  package.  .  ,  . 

Many  researchers  and  engineers,  however,  seldom  need  to  perform  analyses  alone.  Their  major  task  is  to 
come  up  with  the  best  solution  for  a  problem.  In  other  words,  engineers  need  a  tool  that  can  help  them  synthesize  the 
best  solution  for  the  problem  in  their  hands.  The  place  of  an  analysis  package  in  this  task  description  is  to  provide  the 
solution  and  the  performance  measure  of  the  current  design,  which  help  the  researcher  navigate  in  search  of  the 
optimum.  With  the  definition  of  this  new  need,  computational  electromagnetics  adopts  another  field,  design 
optimization.  Using  the  already  perfected  analysis  modules  in  coordination  with  a  search  method,  design  optimization 
in  computational  electromagnetics  seeks  the  optimum  solution  for  a  specified  problem.  This  problem  has  an  objective 
function  and  constraints.  Design  optimization's  task  is  to  maximize  (minimize)  the  objective  function  while  not 
violating  the  constraints.  The  objective  function  f  (xj),  is  a  function  of  the  design  variables,  X;.  The  number  of  design 
variables  defines  the  dimension  of  the  search  space.  There  is  a  set  of  constraints  that  govern  the  domain  of  the  design 
variables.  Each  constraint  can  be  written  as  an  inequality  pj  (Xj)  <  0.  The  optimization  is  formulated  as: 

Maximize  f  (x;)  (objective  function) 

Subject  to  pj  (Xj)  <  0  (constraints) 

Problems  dealing  with  the  optimization  of  parameters  are  seen  in  every  field.  Mathematical  optimization  can 
be  dated  back  as  early  as  the  mid  1700s  when  Euler  developed  the  calculus  of  variations.  Many  other  mathematical 
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optimization  techniques  are  derived  from  the  calculus  of  variations.  In  the  last  20-30  years,  progress  has  been  made  in 
solving  inverse  problems  largely  due  to  the  mathematical  regularization  theory  developed  by  Soviet  scientists  [1]. 

In  electromagnetics,  numerical  techniques  combined  with  the  available  computing  capability  raise  the 
opportunity  for  incorporating  the  analysis  methods  with  a  search  scheme  to  locate  the  optimum  in  the  search  space. 
This  very  promising  environment  results  in  the  addition  of  new  techniques  to  the  arsenal  of  numerical  optimization 
methods.  Stochastic  search  techniques,  such  as  simulated  annealing,  evolution  strategies,  and  genetic  algorithms, 
along  with  artificial  intelligence  based  methods,  such  as  neural  networks,  have  been  introduced  in  the  last  ten  years. 
Optimization  schemes  that  combine  one  or  a  hybrid  of  several  of  these  methods  with  computational  electromagnetics' 
analysis  tools  are  reported  to  have  successful  applications  to  design  optimization  problems  of  various  kinds.  Genetic 
algorithms  (GAs),  developed  by  John  Holland  in  the  1 970s,  attracted  the  interest  of  more  and  more  researchers  due  to 
their  robustness  and  efficiency  in  handling  search  problems.  In  computational  electromagnetics,  design  optimization 
can  benefit  from  the  use  of  this  highly  successful  method.  The  finite  element  method  joined  with  GAs  to  form  a 
design  optimization  environment  for  electromagnetic  devices. 

One  of  the  most  difficult  aspects  of  design  optimization  is  the  shape  optimization  problem.  The  optimization 
technique  described  here  is  not  limited  to  shape  optimization.  Other  design  parameters,  such  as  excitation  and 
material  characteristics  as  well  as  device  terminal  parameters  may  also  be  a  part  of  the  optimization's  goal.  Thus,  a 
multidimensional  search  space  is  under  consideration.  The  optimization  method  must  be  able  to  perform  an  effective 
search  in  a  multidimensional  parameter  space.  As  a  shape  optimization  problem  may  require  a  high  number  of  control 
points  to  be  adjusted,  the  number  of  design  parameters  can  be  quite  large.  This  should  not  pose  a  major  difficulty  for 
the  optimization  scheme.  In  other  words,  the  developed  scheme  must  possess  the  flexibility  to  adapt  easily  to  different 
design  needs. 

The  multiparameter  objective  function  for  a  simple  design  optimization  problem  in  magnetostatics  can  be 
non-convex  (local  extremes),  non-differentiable,  and  stiff  [2-3].  The  optimization  algorithm  must  be  able  to  deal  with 
these  difficulties.  It  is  known  that  a  search  method  based  on  deterministic  methods  tends  to  fail  when  the  objective 
function  is  discontinuous.  The  local  extremes  also  present  a  significant  danger  to  the  success  of  the  search  because  the 
gradient  based  algorithm  can  be  easily  trapped  in  such  a  local  extrema.  Finally,  the  search  process  is  rated  by  its 
speed  of  finding  a  solution  and  the  accuracy  of  the  solution,  its  closeness  to  the  global  optimum. 

Besides  the  search  algorithm,  an  analysis  scheme  is  also  needed.  The  analysis  module's  task  is  to  provide 
field  solutions  for  the  different  designs  suggested  by  the  search  tool.  Modeling  the  problem  is  one  of  the 
responsibilities  of  the  analysis  module.  Especially  in  shape  optimization,  accurate  modeling  of  the  suggested  design  is 
very  critical.  The  analysis  section  of  the  optimization  environment,  therefore,  must  be  able  to  model  the  problem 
accurately  and  efficiently,  and  it  must  provide  accurate  solutions  for  the  field  parameters.  The  search  relies  on  the 
analysis  done  on  its  suggestion. 


2.  APPLICATION  OF  GAs  TO  DESIGN  OPTIMIZATION 

There  are  two  fundamental  functional  bodies  in  a  design  optimization  process:  the  analysis  tool  and  the 
search  algorithm.  They  continuously  interact,  as  for  every  new  set  of  design  parameters  the  search  algorithm  suggests 
the  analysis  tool,  solves  the  governing  equations  and  determines  the  value  of  a  new  point  in  the  search  space.  The 
genetic  algorithm  performs  the  search  operation  and  the  finite  element  method  carries  out  the  analysis.  A  schematic 
diagram  of  the  procedure  is  given  in  Figure  (1).  The  GA  comprises  all  the  blocks  on  the  left  side  of  the  figure.  The 
analysis  is  shown  as  a  single  block,  which  contains  all  of  the  processes  of  FEM. 

A  GA  starts  the  search  from  an  initial  population  of  a  certain  number  of  members.  This  population  is 
randomly  created  within  the  domain  of  the  search  space.  The  members  of  the  population  are  finite  length  string 
structures  called  chromosomes.  Coding  the  design  parameters  using  a  finite  length  alphabet  forms  these  structures. 
Generally  a  low  cardinality  alphabet  is  used,  such  as  the  binary  system.  The  coding  of  the  parameters  is  entirely 
application  dependent  and  it  is  one  of  the  critical  aspect  of  the  performance  of  a  GA.  For  the  binary  alphabet,  two  of 
the  well-known  ones  are  binary  and  gray  coding.  The  bits  in  the  strings  are  called  genes.  As  every  chromosome  in  the 
population  represents  another  design,  it  is  necessary  to  determine  how  well  fitted  each  design  is.  The  FEM  creates  a 
mesh  for  that  particular  design,  solves  the  governing  equations,  and  informs  the  GA  about  the  performance  of  the 
designed  device  by  returning  the  necessary  values  such  as  magnetic  flux  densities  at  critical  points. 
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From  the  results  of  the  analysis,  the  GA  calculates  the  objective  function  value  for  each  chromosome.  This 
value  is  called  the  fitness  of  the  chromosome  and  plays  a  significant  role  in  the  further  steps  of  the  search.  Also,  an 
average  fitness  is  calculated  representing  the  fitness  of  the  entire  population.  The  decision  on  the  convergence  of  the 
search  is  made  based  on  the  average  fitness  as  indicated  in  Figure  (1).  If  the  search  has  to  continue,  the  GA  creates  a 
new  generation  from  the  old  one.  There  are  three  operators,  which  exclusively  characterize  the  GAs:  reproduction, 
crossover,  and  mutation. 

Based  on  the  fitness  of  a  chromosome  and  the  average  fitness  of  the  population,  the  reproduction  operator 
determines,  rather  randomly,  whether  that  particular  chromosome  will  have  copies  in  the  next  generation,  and  how 
many.  There  are  many  ways  of  designing  this  operator,  however,  the  underlying  idea  is  to  give  the  chromosome  with 
a  higher  fitness  more  chance  to  be  represented  in  the  next  generation  but  leave  the  actual  decision  to  a  random 
variable.  Once  the  reproduction  is  complete,  the  new  generation  is  made  of  copies  of  the  previous  generations’ 
members.  Nevertheless,  there  is  no  new  information.  It  is  time  for  the  chromosomes  to  exchange  information  through 
the  crossover  operator.  At  this  point  the  number  of  chromosomes  in  the  population  has  not  changed. 


Figure  (1)  Genetic  Algorithms  in  Design  Optimization 

The  crossover  operator  forms  pairs  from  the  new  generation  mating  chromosomes  randomly.  For  each  pair 
all  bits  from  a  randomly  selected  position  on  the  string  to  the  end  of  the  string  are  swapped. 

The  third  fundamental  operator  of  GAs  is  the  mutation  operator.  It  occasionally  changes  the  value  of  a  gene 
acting  as  a  protector  against  the  complete  loss  of  some  important  genetic  information  [4].  Both  crossover  and 
mutation  operators  have  occurrence  probabilities.  Compared  to  crossover,  mutation  happens  much  less  frequently. 

After  all  these  operators  perform  their  functions,  the  new  generation  is  made  of  members  who  have  gained 
new  information  through  the  exchange  between  pairs.  The  better  traits  of  the  “parent”  chromosomes  are  earned  along 

to  the  future  generations.  ...  , ,  ... 

Several  practical  issues  that  need  a  special  attention  in  implementing  GAs  to  optimization  problems  will  now 
be  examined.  These  will  contribute  to  achieving  improvements  in  the  convergence  characteristics  of  GAs.  There  are 
three  areas  were  improvements  could  be  achieved.  These  include  the  representation  scheme  as  well  as  the 
reproduction  and  crossover  operators. 
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2.1  Representation  scheme 

The  representation  scheme  is  the  method  of  coding  the  design  variables  into  finite-length  character  strings. 
Generally,  a  binary  alphabet  is  used  as  explained  earlier.  Although  the  trivia!  binary  coding  yields  acceptable  results, 
it  is  reported  that  the  unit-distance  Gray  coding  performs  better  [5].  Assume  the  binary  number  01111111  represent  a 
point  in  a  one  dimensional  search  space.  The  nearest  point  that  can  be  mapped  with  this  representation  scheme  using 
binary  coding  is  1 00000000.  This  small  step  in  the  search  space,  however,  requires  all  8  bits  of  the  chromosomes  to 
change.  Gray  code  is  a  unit  distance  coding  system.  When  the  number  the  code  represents  changes  by  one  unit,  the 
change  in  the  code  is  also  only  one  bit.  This  characteristic  of  the  Gray  code  prevents  the  search  from  getting  trapped 
in  cases  like  the  above  example. 

2.2  Reproduction 

Working  on  the  entire  population,  the  reproduction  operator  creates  a  new  generation  from  the  old 
generation.  Reproduction  in  GAs  is  probabilistic.  Based  on  the  fitness  measure  of  an  individual  and  the  average 
fitness  of  the  population,  the  reproduction  operator  determines  the  number  of  copies  that  particular  individual  will 
have  in  the  next  generation.  The  underlying  idea  in  designing  the  reproduction  operator  is  to  give  the  individual  with 
higher  fitness  more  chance  to  be  represented  in  the  next  generation  but  leave  the  decision  to  a  random  variable. 

2.3  Crossover 

Through  crossover,  chromosomes  exchange  genetic  information.  The  crossover  operator  simulates  the 
recombination  process  of  natural  evolution.  Both  methods  summarized  below  are  derived  from  one-point  crossover 
[6]. 

1)  Two-point  crossover.  As  the  name  suggests,  there  are  two  randomly  selected  crossover  sites.  Only  the  genes 
located  between  these  two  sites  are  swapped.  This  can  be  best  demonstrated  with  an  example.  Suppose  the  coding  is 
binary  with  an  alphabet  A  =  {0,1 }.  Assume  that  before  crossover  two  strings,  a,  and  a2,  are  mated  and  are  represented 
as: 


=  1 10|0101 101110100 
a2=  100|1100011100010 

also  assume  that  the  two  randomly  selected  crossover  sites  for  this  pair  are  3  and  9,  as  indicated  above  with  the 
symbol  |.  After  crossover  the  offspring  become: 

a’y  =  110|110001|110100 

a* 2  =  10010101 101100010 

This  technique  can  be  generalized  to  implement  n-point  crossover  algorithms.  The  bits  between  every  other 
crossover  site  pair  are  then  swapped. 

2)  Uniform  crossover.  Uniform  crossover  eliminates  the  positional  bias  inherent  in  one  and  two-point  crossover 
methods.  There  are  two  main  steps  to  perform  a  uniform  crossover.  The  first  step  is  to  randomly  determine  how  many 
bits  are  to  be  swapped.  Assume  the  outcome  is  k.  Of  course  k  must  be  smaller  than  the  chromosome  length,  /.  In  the 
second  step,  k  locations  on  the  chromosome  are  randomly  selected,  and  the  bits  at  these  k  locations  are  swapped.  The 
genetic  information  exchange  is  still  effective,  and  there  is  no  bias  regarding  the  position. 


3.  PROBLEM  MODELING  FOR  SHAPE  OPTIMIZATION 

The  technique  used  for  the  field  solution  of  the  electromagnetic  problem  is  the  finite  element  method.  The 
geometric  modeling  of  the  problem  establishes  an  interface  that  joins  the  search  and  the  solution  algorithms. 


397 


Therefore  the  efficiency  of  this  interface  is  crucial  to  the  performance  of  the  optimization.  In  shape  optimization 
problems,  each  member  of  the  GAs  populations  describes  the  geometrical  characteristics  of  a  specific  design  that  is 
translated  into  a  finite  element  model  and  then  solved  to  obtain  the  fitness  for  this  particular  individual.  It  is  a 
significant  task  to  determine  the  optimal  set  of  design  parameters.  These  parameters  are  used  to  form  the  outlines  of 
different  regions  of  the  modeled  device.  Curve  fitting  techniques  as  well  as  an  increase  in  the  number  of  points  that 
control  the  shape  would  enhance  the  problem  modeling  for  shape  optimization. 


4.  THE  PREMATURE  CONVERGENCE  PROBLEM 

A  significant  problem  faced  in  the  application  of  GAs,  to  some  problems,  is  premature  convergence.  Relative 
to  the  other  members  in  the  population,  if  some  of  the  average  individuals  have  extraordinary  fitness,  the  mechanics 
of  GAs  may  lead  the  iterations  to  a  premature  convergence,  which  is  defined  as  a  convergence  to  a  sub-optimal  point. 
There  are  two  main  factors  directly  leading  to  this  undesired  problem  in  the  implementation  of  GAs.  The  selective 
pressure  applied  by  the  selection  operator  is  the  first  factor.  The  second  factor  is  in  the  way  the  recombination  or 
crossover  is  implemented.  If  the  identical  parents  and  their  identical  offspring  represent  a  sub-optimal  point,  the 
convergence  is  premature.  Luckily,  there  is  another  operator  that  quietly  helps  resolve  this  situation.  The  mutation 
operator  randomly  alters  genetic  information  of  the  chromosomes  by  toggling  some  genes.  The  occurrence  rate  of  a 
mutation,  however,  is  significantly  low.  Therefore  it  is  not  always  a  cure  for  the  causes  of  premature  convergence. 

In  some  applications  of  GAs,  the  parents  are  paired  strictly  based  on  their  fitness  to  enhance  the  rate  of 
convergence.  Better  individuals  always  mate  with  better  partners.  This  strategy  increases  the  possibility  of  having 
identical  parents,  and  identical  parents  can  only  reproduce  offspring  genetically  equal  to  them.  Genetic  diversity, 
which  is  one  of  the  essentials  driving  forces  of  natural  evolution,  is  severed.  A  safer  approach  is  to  leave  the  selection 
to  a  random  variable  while  still  giving  the  better  individuals  a  higher  chance  of  paring  with  each  other.  The  problem 
of  premature  convergence  is  demonstrated  in  the  following  example. 


5.  HYBRID  IMPLEMENTATION  OF  GAs 

As  the  onlv  search  technique  in  an  optimization  process,  GAs  have  been  applied  to  a  variety  of  design 
optimization  problems  in  electromagnetics  [4,  7-9].  Combining  GAs  with  a  deterministic  gradient  search  method  [10- 
12]  creates  a  hybrid  technique.  Specific  strengths  of  each  algorithm  are  used  at  different  stages  of  the  search  in  the 
hybrid  method. 

GAs  “explore  and  exploit”  the  search  space  to  locate  the  global  optimum  without  being  attracted  to  local 
optima.  They  work  with  the  value  of  the  objective  function  and  do  not  require  derivatives.  These  characteristics 
become  significant  in  optimization  problems  with  non-convex  objective  functions,  encountered  in  magnetostatic 
problems.  The  GA  is  used  here  to  find  the  area  of  the  search  space  which  contain  the  global  optimum  where  the 
objective  function  is  assumed  convex. 

Deterministic  methods,  on  the  other  hand,  fail  for  these  types  of  problems.  Either  the  search  becomes 
impossible  due  to  the  non-differentiability  of  the  objective  function,  or  it  gets  trapped  in  a  local  optimum. 

The  hybrid  technique  employs  the  two  algorithms  at  different  stages  of  the  search.  GAs  initiate  the  search 
and  locate  the  “zone”  of  the  global  optimum.  Once  in  this  zone,  the  gradient  technique  takes  over  [10].  Starting  from 
this  improved  initial  point,  the  search  quickly  converges.  It  is  assumed  that  in  the  vicinity  of  the  global  optimum,  the 
objective  function  is  differentiable  and  is  free  from  local  optima. 


6.  THE  GENERALIZED  OPTIMIZATION  ENVIRONMENT 

When  optimizing  designs  for  practical  electromagnetic  problems,  the  number  of  design  constraints  and 
variables  is  increased  drastically.  The  construction  and  step-by-step  creation  of  an  electrical  system,  in  practice,  is  a 
trial  and  error  process.  The  design  may  lead  to  a  sub-optimal  solution  since  the  success  of  a  design  depends  on  the 
experience  of  the  designer.  It  is  therefore  necessary  to  simulate  the  physical  behavior  of  the  electrical  system  by 
numerical  methods  in  a  generalized  fashion.  To  obtain  an  automated  optimum  design,  numerical  optimization  is 
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necessary  to  achieve  a  well-defined  optimum.  Optimization  requires  that  all  design  goals,  of  a  device,  be  connected 
into  a  single  objective  function  with  all  independent  variables  and  constraints.  In  applications  to  practical  problems, 
effective  pre-processing  and  post  processing  of  data  are  necessary  in  addition  to  high  performance  computing 
capabilities.  To  achieve  this  process,  a  generalized  optimization  environment  is  suggested  in  Figure  (2). 

The  system  architecture  for  this  optimization  can  be  utilized  for  the  computational  creation  of  an  electrical 
device  or  a  system.  It  is  intended  to  be  a  design  and  development  toolbox  system  for  practical  applications  in  an 
industrial  environment.  As  shown  in  Figure  (2),  the  generalized  optimization  environment  involves  four  main 
interacting  blocks.  These  represent  1)  the  computing  platform,  2)  the  analysis  toolbox,  3)  an  optimization  chest,  and 
4)  the  design  engineer’s  observation  and  decision  making  path.  The  optimization  chest  is  a  black  box  of  various 
optimization  techniques.  The  choice  of  an  optimization  procedure  versus  another  is  based  on  the  expectation  that  the 
selected  method  will  work  faster  or  give  best  answers  to  a  particular  problem  rather  than  the  heuristic  choice  of  a 
method  that  is  application  specific.  The  interaction  between  the  modules  is  described  in  a  shared  open  system 
computational  environment  which  will  make  use  of  available  computing  facilities  and  utilize  the  experiences  and 
tools  gained  from  previous  application. 

6.1  The  Computing  Platform 


Figure  (2)  Generalized  Optimization  Architecture  for  Practical 
Design  of  Electrical  Devices 

The  current  improvements  and  increase  in  capabilities  of  desktop  computing  facilities  will  enable  the 
designer  to  utilize  vast  computing  resources  that  are  available  in  the  design  environment.  The  computing  platform 
block  in  Figure  (2)  represents  the  computing  facilities  that  may  be  available  in  the  design  environment. 

In  this  platform,  parallel  computing  machines,  clustered  workstations  and  personal  computers  could  be 
utilized  to  provide  super-computing  capabilities  at  low  cost.  Workstation  farms  connected  to  data  switches  and  data 
communication  networks  could  provide  automated  access  to  clusters  of  workstations  that  could  be  shared  at  distant 
locations  or  at  the  work  place. 
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6.2  The  Analysis  Toolbox 

This  toolbox  includes  a  collection  of  modeling  tools  and  analysis  techniques.  Software  packages  for  FEM, 
BEM.  FDTD  as  well  as  other  techniques  will  be  accessed  concurrently  by  the  Optimization  Chest  to  determine  the 
objective  function  values,  and  by  the  design  engineer  to  observe  the  performance  and  check  the  design  parameters 
interactively.  The  analysis  toolbox  also  include  information  about  the  various  methods  and  their  formulation, 
facilities  for  pre-  and  post  processing  of  input  and  output  data,  mesh  generation,  mesh  density  information,  adaptive 
meshing  and  additional  interfacing  software  to  access  the  various  blocks  in  the  overall  system. 


6.3  The  Optimization  Chest 

This  block  includes  a  collection  of  design  methods.  Each  is  treated  as  a  black  box  with  an  interface  that 
accepts  the  appropriate  problem  specification.  The  techniques  in  this  box  include  evolutional^  algorithms  such  as 
evolution  strategies,  evolutionary  programming,  genetic  algorithms  and  genetic  programming.  It  also  includes 
artificial  intelligence  algorithms  such  as  neural  networks,  simulated  annealing,  expert  systems,  fuzzy  reasoning  and 
adaptive  learning.  In  addition  to  these  emerging  methodologies  standard  search  algorithms  ranging  from  gradient 
methods,  random  search,  hill  climbing  and  biased  random  walk  will  be  also  included  in  the  optimization  chest.  As 
practical  applications  require  many  objectives  that  may  not  be  met  by  one  of  the  methods,  combinations  and 
generalizations  of  these  algorithms  have  proved  to  be  powerful.  Design  sensitivity  techniques  are  also  included  in  the 
optimization  chest. 

6.4  Parametric  Design  in  the  Optimization  Chest 

The  optimization  chest  described  above  is  a  stochastic  blackbox  approach  to  design  optimization.  To  check 
its  performance  in  a  practical  environment,  each  box  representing  a  technique  should  be  based  on  the  same 
assumptions.  These  may  include;  1)  the  values  for  each  parameter  must  be  found,  2)  each  parameter  has  finite  number 
of  possible  values.  3)  combinations  of  parameter  values  may  be  unacceptable,  4)  there  is  a  function  to  give  a  measure 
for  each  set  of  parameters,  and  5)  the  objective  function  is  to  produce  an  optimal  set  of  design  parameter  values  for  an 
acceptable  scoring  measure.  The  choice  of  an  optimization  method  should  be  based  on  which  method  can  work  faster 
or  give  best  answers  to  the  problem  rather  than  if  the  method  will  work  at  all  for  the  given  application. 
Approximating  continuous  parameters  with  discrete  values  will  work  for  most  applications  but  may  increase  the 
problem  dimensionally.  Combination  methods  are  used  to  reduce  the  size. 

6.5  Problem  Specification 

Applving  any  or  all  of  the  design  methods  to  a  given  problem  can  be  successful  with  proper  problem 
specification. 'Some  suggested  methods  to  enable  the  user  specify  the  problem  are  discussed  here.  These  methods 
could  be  coded  in  software  that  will  execute  before  invoking  the  design  process.  The  creation  of  a  design  policy,  in 
the  form  of  a  rule-generating  function  that  produces  a  feasible  set  of  rules,  may  be  developed.  Each  use  of  this 
function  mav  produce  a  different  set  of  rules  for  every  application.  For  this  reason,  the  feasibility  of  the  generated 
rules  must  be  checked.  Formulating  a  function  that  will  be  used  to  determine  the  feasibility  of  the  generated  rules  is 
an  important  task  in  specifying  the  problem.  Such  a  function  will  take  any  generated  set  of  input  and  determines  if  the 
values  satisfy  all  users  imposed  constraints.  The  next  step  in  specifying  the  problem  is  a  scoring  function  that  takes 
the  rules  as  input,  evaluates  it  and  returns  a  measure  of  merit.  In  order  to  evaluate  several  methods,  the  same  scoring 
function  must  be  used.  The  performance  of  any  method  in  the  optimization  chest  could  be  determined  based  on  the 
number  of  objective  function  evaluations,  or  on  the  time  it  takes  to  return  a  feasible  solution  or  a  combination  of  these 
ideas.  The  final  item  in  the  problem  specification  process  is  an  output  function,  which  will  display  the  input  to  the 
optimization  chest.  Here,  other  user  inputs  such  as  relating  the  parameters  to  those  in  a  database  or  empirical  data 
from  previous  design  or  adding  explanations  and  additional  restrictions  on  the  parameters  may  be  added.  This  process 
can  be  used  at  the  end  of  the  search  process  for  determining  if  a  good  design  is  reached.  This  process  can  also  be  used 
during  the  search  as  a  checking  mechanism  to  interrupt  to  see  the  search  result  at  some  point. 
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6.6  Observations  and  Decision  Making 

This  part  of  the  optimization  environment  includes  information  access  in  the  form  of  query  search  database 
in  an  industrial  or  a  research  environment.  This  may  include  interactive  features  to  access  information  on  field  and 
experience  data,  design  aids  and  rules,  empirical  data,  knowledge  base,  cost  tables  and  parts  list  for  off-the-shelf 
design  optimization. 


7.  CONCLUDING  REMARKS 

GAs  provides  a  high  level  of  robustness  by  simulating  nature’s  capability  of  adapting  to  many  different 
environments.  Through  the  application  of  GAs  to  design  optimization  problems,  the  performance  characteristics  of 
the  GAs  are  shown  to  be  powerful  in  solving  optimization  problems  with  high  dimensional  objective  functions 
containing  several  local  minima.  As  they  use  only  the  fitness  values,  GAs  does  not  require  derivatives  or  any  other 
additional  information  about  the  objective  function.  The  performance  of  GAs  can  be  improved  significantly  by 
applying  selective  pressure  and  adequately  providing  a  shape  modeling  technique  to  control  the  range  of  variation  for 
the  shape  to  be  optimized. 

The  generalized  optimization  environment  is  valuable  for  current  and  future  activities  in  computational 
electromagnetics  for  analysis  and  design  of  practical  applications.  This  environment  involve  interacting  blocks  and 
adds  the  input  of  the  design  engineer  to  the  process  and  giving  him  the  final  decision  making  on  the  achievement  of 
an  optimal  design.  The  interactions  between  the  various  blocks  of  the  system  can  be  implemented  in  an  industrial  or  a 
research  environment. 
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Introduction 

While  FDTD  methods,  such  as  that  of  Yee  [1],  are  very  versatile  and  have  found 
widespread  use  in  electromagnetics,  modeling  thin  wires  in  such  algorithms  can  present 
special  problems.  The  wires  are  usually  much  thinner  than  the  desired  cell  size,  and  reduc¬ 
ing  the  cell  size  would  require  also  reducing  the  time  increment.  Several  methods  have  been 
used  to  treat  thin  wires  in  FDTD.  Merewether  and  Fisher  [2]  and  Umashankar  et  al.  [3]  used 
contour  integration  incorporating  the  If  p  field  behavior.  Parks  et  al.  [4]  and  Holland  [5] 
coupled  the  transmission  line  equations  for  the  wire  current  to  the  FDTD  mesh  equations. 
Riley  [6]  and  Mittra  [7]  solved  integral  equations  for  the  wire  current  coupled  to  the  mesh 
fields.  These  methods  usually  are  best  suited  to  wires  that  lie  along  edges  for  the  electric 
or  magnetic  field  in  the  mesh.  When  the  wire  can  be  moved,  it  often  results  in  reduced 
accuracy.  Requiring  wires  to  lie  on  cell  edges  restricts  the  wire  geometry  when  cells  are 
orthogonal,  and  even  with  more  general  meshes  it  requires  generation  of  a  new  mesh  when 
a  wire  must  be  moved.  This  limitation  may  be  a  major  reason  that  FDTD  methods  still  are 
not  nearly  as  popular  as  frequency  domain  integral  equation  and  finite  element  techniques 
for  modeling  wire  structures. 

It  would  be  very  valuable  to  be  able  to  locate  wires  independently  of  the  mesh.  This 
paper  presents  some  efforts  in  that  direction,  considering  the  lateral  movement  of  wires  par¬ 
allel  to  cell  edges  and  also  adjustment  of  the  wire  length  with  respect  to  the  cell  boundaries. 
The  transmission  fine  method  used  for  the  wire  is  similar  to  that  described  by  Holland  [5], 
but  the  averaging  to  couple  the  transmission  line  fields  to  the  mesh  fields  is  done  in  a  way 
that  can  be  generalized  to  wires  moved  with  respect  to  cell  edges.  The  treatment  of  wires 
passing  through  electric  field  edges  is  considered  first,  and  then  is  generalized  to  wires  moved 
from  the  edges.  The  accuracv  when  the  wire  is  moved  is  demonstrated  by  comparisons  with 
NEC  [8]. 

Treatment  of  Wires  on  Cell  Edges 

In  the  method  considered  here  the  transmission  line  equations  for  current  and  voltage 
along  the  wire  are  coupled  with  the  FDTD  equations  of  the  standard  Yee  algorithm  [1]. 
The  transmission  line  equations  for  the  wire  current  are  derived  from  the  <b  component  of 
Maxwell’s  V  x  E  equation  and  the  p  component  of  the  V  x  H  equation 

dHo  dEz  dEp 

(1) 

dEp  =  1 8Hz  _  dH*  (2) 

e  dt  p  d(j>  dz  '  1 

For  current  I  and  charge  Q  on  a  wire  on  the  z  axis,  the  Ep  and  H#  components  for  small  p 
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can  be  approximated  as 


E0  = 


Q 

2tt  ep 


and 


Substituting  for  Ep  and  H&  in  equation  (1)  and  integrating  over  p  from  the  wire  radius  a  to 
the  transmission  line  radius  rt,  with  boundary  condition  Ez(a)  —  0,  yields 


p]n(rt/a)dl  „  ,  In (rt/a)dQ 

2tt  dt  z[Tt)  2  ire  dz 
or,  defining  inductance  Lw  =  (p/2ir)  In {rt/a), 


dJ_ 

dt 


j- E*(r* )- 


l_dQ 
pe  dz 


(3) 


At  this  point  the  transmission  line  radius  rt  is  an  arbitrary  distance  to  the  outer  shell  of 
the  transmission  line  that  couples  to  the  mesh  fields.  It  will  be  found  to  drop  out  in  the 
treatments  that  follow.  Substituting  for  Ep  and  in  equation  (2),  and  averaging  over  <p  to 
eliminate  the  dHz/dd>  term  yields  the  continuity  equation 


dQ_  _  _dl_ 
dt  dz' 


(4) 


To  reduce  equations  (3)  and  (4)  to  discrete  form,  the  current  and  charge  will  be  sampled 
at  points  spaced  by  A z  along  the  wire  with  time  step  At.  Then  Ik  and  Qk  represent  the 
current  and  charge  at  sample  point  k  and  time  step  n,  with  charge  points  displaced  from 
current  by  Ax/2.  With  a  central  difference  approximation  of  the  derivatives,  and  introducing 
an  applied  voltage  Vmc ,  equations  (3)  and  (4)  become 

(5) 

(6) 

These  can  be  written  in  terms  of  a  voltage  by  defining  capacitance  Cw  =  pe/Lw  as 


'l-Qt-  0  + 


At 
A  zLn 


(EZkAz-Vr) 


QT 1 


71+1/2^ 


)■ 


j-n+1/2  rn— 1/2  At 
lk  ~ lk  A  ZLn 


W-V^-^Az  +  Vr) 


(7) 

(8) 


The  simplest  situation  for  solving  the  transmission  line  equations  coupled  to  the  FDTD 
mesh  is  when  the  wire  lies  along  the  Ez  edges.  The  transmission  line  current  drives  the 
mesh  through  the  Ez  update  equation,  and  Ez  from  the  mesh  drives  the  transmission  line 
through  the  E*k  term  in  equation  (5).  E%k  in  equation  (5)  is  interpreted  as  the  mesh  field 
averaged  around  the  outer  shell  of  the  transmission  line  at  radius  rj.  However,  the  Ez  value 
available  from  the  mesh  equations  is  the  field  averaged  over  a  cell  face.  Holland  [5]  deals 
with  this  difference  by  averaging  the  equations  (5)  and  (6),  so  that  E™k  in  equation  (5) 
becomes  the  average  field  over  the  face,  which  is  available,  and  Lw  is  averaged  for  p  from  a 
to  rt  =  y/ Ax  A  ypK.  The  average  inductance  can  be  approximated  as 


Lw 


V  /artln  (r/a)rdr  = 

2t r  J^rdr  2tt 


(9) 
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Holland  also  averages  over  a  rectangular  cell  face  and  applies  an  empirical  correction  factor. 
This  approach  gives  accurate  results,  but  it  is  not  clear  how  to  generalize  this  averaging  for 
a  wire  moved  off  of  the  Ez  edge. 

Another  way  that  was  found  to  work  well  for  coupling  the  transmission  line  equations 
to  the  mesh  fields  with  the  wire  on  Ez  edges  is  to  use  the  exact  inductance  for  rt  =  Ax/2, 
and  drive  equation  (5)  with  an  average  of  the  Ez  mesh  fields 

£&  =  i  +  Kj.+i,k))  (10) 


for  a  wire  at  mesh  coordinates  (iw,jw,  k).  In  comparisons  with  NEC,  equation  (10)  seemed 
to  give  slightly  better  accuracy  than  Holland’s  average  inductance  with  correction  factor, 
and  remained  stable  to  larger  wire  aj Ax.  Another  method  used  by  Riley  [6]  is  to  locate 
the  wire  on  the  Hz  edges  and  let  the  transmission  line  radius  pass  through  the  surrounding 
Ez  points.  Actually  Riley  had  an  unstructured  mesh  and  rt  was  the  average  distance  to  Ez 
samples.  For  cubical  cells,  r*  =  Axf  \/2.  and  E7/k  in  equation  (5)  is  the  average  of  the  Ez 
mesh  fields 

E?k  =  \  (E%xjWyk  +  Eil+ij„,k  +  EZ,jw+i,k  +  Eiu+hjw+Uk)  (11) 


for  the  wire  at  (*„+ 1/2,  j„+l/2,  k).  It  is  not  obvious  how  to  generalize  any  of  these  averages 
for  a  wire  at  an  arbitrary  location  in  the  x-y  plane  while  maintaining  the  same  accuracy.  We 
also  tried  bi-quadratic  interpolation  on  the  3  x  3  edges  surrounding  the  wire  on  an  Ez  edge, 
and  bi-cubic  interpolation  on  the  4  x  4  Ez  edges  surrounding  a  wire  on  a  Hz  edge,  but  the 
results  were  not  satisfactory. 


Wires  Displaced  from  Cell  Edges 

To  generalize  the  wire  location,  we  looked  at  the  behavior  of  the  fields  in  the  mesh  due 
to  the  wire  excitation.  The  field  component  Ez  at  a  distance  0,  Ax  and  2 Ax  from  a  wire 
located  on  the  E,  edges  is  shown  in  Figure  1  for  varying  wire  radius,  with  the  computed 
fields  connected  by  straight  lines.  The  wire  length  was  1.4  m  with  a  cell  size  of  Ax  =  0.035 
m  and  the  center  three  segments  of  the  wire  were  excited  with  1/3  volts  each,  so  that  results 
could  be  compared  for  a  cell  size  three  times  larger  without  changing  the' source  gap  size. 
The  fields  in  Figure  1  were  obtained  using  the  average  field  of  equation  (10)  to  drive  the 
transmission  line.  Similar  results  were  obtained  using  Holland’s  average  for  a  square  cell 
with  correction  term  [5,  eq.  53]  but  that  solution  became  unstable  for  radii  of  0.01  and  0.014 
m.  The  fields  in  Figure  1  represent  the  the  peak  values  adjacent  to  the  center  of  the  wire 
due  to  a  Gaussian  pulse  excitation  with  full- width- half- max  (tfwhm)  of  2  ns.  For  all  solutions 
in  Figure  1  the  wire  current  was  in  good  agreement  with  the  NEC  model. 

Examination  of  Figure  1  shows  that  Ez  varies  with  x  approximately  as  the  average  of 
ln(x/a)  with  the  function  going  negative  for  x  <  a.  This  differs  from  the  average  m  equation 
(9)  where  the  field  is  assumed  zero  inside  the  wire.  The  transmission  line  equations,  m 
effect,  put  a  boundary  condition  Ez{a)  -  0  on  the  fields  that  the  mesh  is  averaging,  but  the 
field  goes  negative  inside  the  wire  as  if  it  were  produced  by  a  filament  of  current  on  the  wire 
axis.  A  simple  model  for  comparison  with  the  mesh  fields  is  obtained  by  averaging  log(p/a) 
over  an  annular  region  extending  from  p  =  max(0,  ro  —  A/2)  to  ro  +  A/2.  This  yields 


Fo(r0,  A,  a) 


_i  +  if  r0  >  A/2 

_  1  +  \n(r2/a)  *  if  0  <  r0  <  A/2 


(12) 
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Table.  1.  Relations  of  average  E.  from  the  mesh  and  cor¬ 
responding  average  of  log(r/a)  for  a  distance  of  0  and  Ax 
from  a  wire  of  radius  a  ( E0 ,  Ey  and  Aq,  Ay,  respectively). 


Wire 
rad.  (m) 

Eo/Ey 

At) /Ay 

(Eo  +  Ey)/ 
(2£o) 

**(£)/ 

Ao 

0.014 

-0.557 

-0.551 

-0.411 

-0.425 

0.01 

-0.169 

-0.143 

-2.48 

-2.963 

0.003 

0.380 

0.411 

1.806 

1.737 

0.001 

0.486 

0.592 

1.380 

1.354 

10“3 

0.826 

0.821 

1.112 

1.111 

10-7 

0.885 

0.886 

1.065 

1.066 

where  n  =  ro  -  A/2  and  T2  =  tq  +  A/2.  The 
average  could  be  computed  over  cell  faces,  but  in 
either  case  it  was  found  that  a  fudge  factor  was 
needed  on  the  region  size  for  best  agreement  with 
the  observed  fields  and  best  agreement  of  the  cur¬ 
rents  with  NEC.  For  a  mesh  of  cubical  cells  with 
size  Ax.  it  was  found  that  the  mesh  fields  near  the 
wire  were  matched  best  by  Fa(p,  fcAx.  a)  with  fc 
approximately  0.78.  This  seems  to  work  reasonably 
well  over  a  wide  range  of  wire  radii  and  cell  sizes. 
When  averaging  over  a  square  cell  face,  rather  than 
an  annular  region,  the  factor  was  approximately 
0.70 Ax  to  shrink  the  side  length.  Holland  arrived 
at  a  similar  correction  by  subtracting  a  constant 
from  his  average  inductance  to  make  it  cross  zero 
at  an  aj Ax  that  matched  a  moment  method  solu¬ 
tion  for  the  wire.  The  factor  of  0.78  for  the  annular 


Fig.  1.  Ez  in  the  mesh  at  distances  0,  Ax 
and  2Ax  from  a  wire  with  radius  a  for  Ax  = 
0.035  m. 


average  was  chosen  for  best  agreement  of  the  dipole  input  impedance  with  NEC,  although 
0.80  would  work  about  as  well.  Any  significance  of  this  factor  is  not  understood  at  this 
point.  However,  it  was  noted  that  fc  =  0.802188  is  the  solution  of  the  equation 


[io(0,  fcAx,  a)  +  Fa(Ax,  fc  Ax,  a)]/2  =  log(Ax/2a)  (13) 


independent  of  Ax  and  a.  That  is,  the  average  of  the  two  averages  is  equal  to  log(p/a)  at 
p  =  Ax/2. 

The  ratios  of  Eq/E\  and  Aq/A\  are  compared  in  Table  1.  where  Eq  and  E\  are  the 
mesh  fields  Ez  at  the  wire  and  one  Ax  away,  and  Aq  and  A\  represent  Fa(p,  /cAx,  a)  at 
p  =  0  and  Ax.  The  correspondence  shows  that  Fa(p,  fcAx.  a)  tracks  the  field  well  over  the 
range  of  wire  radii.  For  the  case  of  Figure  1,  where  Ax  =  0.035  m  and  ifwhm  =  17Ax/c  the 
function  Fa{p,fc Ax,  a)  tracks  the  mesh  fields  reasonably  well  out  to  several  Ax  from  the 
wire,  while  for  the  same  wire  with  Ax  =  0.1  m  and  tfwhm  =  6A x/c  the  average  tracks  the 
mesh  fields  from  0  to  one  Ax.  In  the  last  two  columns  of  Table  1,  the  average  of  the  mesh 
fields  (Eq  +  Ei)/ 2  and  log(Ax/2a)  are  compared,  both  normalized  by  their  average  about 
the  wire.  The  correspondence  is  expected,  since  the  average  of  equation  (10)  works  well  in 
driving  the  transmission  line  equations. 

Using  Fa(p,  fcAx,  a)  as  a  model  for  the  averaged  mesh  fields,  the  local  Ez  at  a  distance 
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p  from  the  wire  can  be  related  to  the  mesh  field  at  the  wire  location  as 

E2M  =  E?(iwJw,  k)  \n(p/a)/ Fa(0,  fcAx,a).  (14) 

Equation  (14)  can  be  used  with  p  =  rt  to  obtain  E*k  driving  the  transmission  line  in  equation 
(5).  However,  equation  (14)  applies  only  to  the  E~z  field  due  to  a  current  on  the  wire.  When 
the  wire  is  excited  with  a  voltage  source,  changing  the  boundary  condition  on  Ez(a).  a  field 
component  due  to  the  derivative  of  charge  is  introduced  that  is  more  slowly  varying  than 
the  form  of  equation  (14).  Hence  the  applied  voltage  should  be  removed  from  the  Ez  mesh 
field  before  applying  the  correction,  and  the  source  term  in  equation  (5)  becomes 

(E?kAx  -  vn  -  [(£& Ax  -  V™)  ln(rt/a)/F„(0,  fc Ax,  a)  +  V™]  -  V, 

=(£?fcAx  -  V;jnc)  ln(rt/a)/i^{0.  fcAx, a). 


Equation  (5)  for  the  transmission  line  current  then  becomes 


■n+l/2  _ 


At 


ta?- 


Axpe  ik 


Ql- 1 


At 

AxL-u 


[A xE%{iw,  jw,  k )  -  l^nc  ln(rt/a)/Fa(0,  fc Ax.  a)j . 


Since  Lw  -  ^ln(rt/a),  the  ln(r4/a)  factors  cancel  to  yield 


7n+l/2  Tn- 1/2  At  \  n  ^ n  ] 

4  Ik  + 


2ttA  t 


Axp,Fa{0,  fcAx, a)  L 


[AxE2(iw,j„,k)-vr]  (15) 


so  that  rt  is  eliminated.  Equation  (15)  is  essentially  the  same  as  that  obtained  by  Holland 
by  averaging  the  equation,  although  the  correction  of  the  average  is  done  in  a  different  way. 

When  the  wire  is  moved  away  from  the  Ez  edge,  Holland  [5]  suggested  splitting  the 
current  by  linear  proportionality  between  the  surrounding  Ez  edges.  Thus  for  wire  current 


k 

k )  =  I£(i  +  1  -  iw)(j  +  1  -  jw)/AxAy  (16a) 

J?(i  +  1J,  k )  =  I£{iw  -  i)(j  +  1  -  jw)/AxAy  (166) 

J2(i,  j  +  1,  k)  =  i£(i  +  1  -  iw){jw  -  j)/AxAy  (16c) 

J?(i  +  1J  +  1,  k)  =  I%(iw  -  i)(jw  -  j)/AxAy  (16d) 


are  the  current  densities  in  the  Ez  update  equations  for  the  wire  located  at  i  <  iw  <  i  +  1 
and  j  <  jw  <  j  +  1.  Holland  suggests  using  the  same  factors  to  interpolate  the  Ez  fields 
from  the  mesh  to  drive  equation  (5).  This  approach  gave  reasonable  agreement  in  the  peaks 
of  admittance,  but  poor  agreement  in  peaks  of  impedance  as  the  wire  was  moved  within  a 
cell. 

The  field  average  in  equation  (11)  gave  good  agreement  with  NEC  for  a  wire  located 
on  an  Hz  edge,  where  the  current  is  split  equally  on  to  the  four  surrounding  Ez  edges. 
Examining  the  mesh  fields  from  this  solution  showed  that  they  do  not  behave  as  an  average 
of  a  ln(p/a)  function  from  the  wire  location,  but  rather  behave  as  a  sum  of  averaged  ln(p/a) 
functions  from  the  driven  Ez  edges.  Generalizing  to  unequal  edge  excitations,  the  field 
averaged  about  a  point  r  due  to  excited  edges  at  r,  would  be  expected  to  have  the  form 

4 

Ez(r)  =  CiF0(|r  -  rf |,  fcAx,  a)  (17) 

i=i 
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where  aSum  =  Ei=i  ai-  Tlie  transmission  a) 
line  radius  n  is  eliminated  in  the  averaging. 

However,  it  would  still  occur  in  the  equa¬ 
tions  involving  voltage,  since  the  transmis¬ 
sion  line  voltage  depends  on  the  character¬ 
istic  impedance  of  the  line.  « 

The  input  impedance  of  a  a  1.4  m  ° 

dipole  modeled  with  cell  size  of  0.035  m  ~ 

is  shown  in  Figure  2  as  the  wire  is  moved  x 

within  a  cell.  The  excitation  was  a  volt-  a 

age  source  spread  over  the  center  three 
segments  (current  nodes).  The  results  in 
Figure  2a.  using  equation  19,  show  good 
agreement  with  NEC  and  independence  of 
wire  position.  Similar  results  were  ob¬ 
tained  for  the  imaginary  part  of  impedance 
and  with  other  wire  radii.  Results  in  Fig¬ 
ure  2b  were  obtained  by  splitting  the  wire 
current  among  the  cell  edges  according  to 
equation  16,  as  done  for  Figure  la,  but  the 
electric  field  driving  the  transmission  line 
equations  was  obtained  as  a  weighted  av¬ 
erage  of  the  surrounding  mesh  fields,  us¬ 
ing  the  average  inductance  of  [5,  eq.  53] . 

Much  more  variation  is  seen  in  the  peaks  of 
impedance  as  the  wire  is  moved.  However, 
when  the  results  of  Figure  lb  are  plotted 
as  admittance,  there  is  only  a  few  percent 
variation  in  the  admittance  peaks  as  the 
wire  is  moved,  similar  to  Holland’s  results. 

Adjustment  of  wire  length 

When  equations  (5)  or  (19)  are  solved 
with  N  charge  (or  voltage)  nodes  and  fixed 
node  spacing  of  Az.  the  effective  length  of 
the  wire  is  NAz.  The  solution  will  include 
jV  +  1  current  nodes,  including  I\  and  In+ 1  which  have  zero  current  and  are  located  at 
Az/2  before  Qi  and  after  Qn,  respectively.  The  length  of  the  wire  can  be  adjusted  if  the 
central  difference  for  the  derivate  of  current  is  replaced  by  a  second-order  approximation 
with  arbitrary  location  of  the  zero  current  nodes.  If  the  points  where  the  current  goes  to 
zero  are  extended  by  S  from  their  normal  positions  on  either  ends  of  the  wire,  the  derivatives 
are 


200  400  600  000 

Frequency  (MHz) 

Fig.  2.  Real  part  of  input  impedance  as  a  1.4  m 
dipole  is  moved  within  the  mesh;  a)  result  of  equa¬ 
tion  (19),  b)  result  using  average  cell  inductance  and 
linearly  weighted  average  of  mesh  fields. 


dJ(z) 


dz 

dl(z) 


Az  —  S 

\z  at  Qi  ~*2Az(J+~Azj 


+  h 


A  z(S  +  2A  z) 


dz 


i  z  at  Qn 


:  -  In 


Az-S 
A  z(6  +  Az) 


-  In-i 


A  z(6  +  2A  z) 


Results  of  adjusting  the  end  locations  by  ±Az/2  from  their  normal  positions  on  either 
end  are  shown  in  Figure  3,  and  are  in  good  agreement  with  NEC.  These  results  used  Az  =  0.1 
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m,  so  the  agreement  with  NEC  is  not  as 
good  as  in  Figure  2. 

Conclusion 

It  was  shown  that  the  accuracy  of  re¬ 
sults  for  wires  moved  laterally  from  mesh 
edges  can  be  greatly  improved  by  talking 
account  of  the  behavior  of  the  field  in  the 
vicinity  of  the  wire.  Also,  the  distance  to 
the  end  of  the  wire  can  be  adjusted  within 
a  cell  by  using  a  general  second-order  dif¬ 
ference  form  for  the  derivative.  Making 
the  wire  location  completely  independent 
of  the  mesh  would  require  the  additional 
ability  to  tilt  the  wire  with  respect  to  the 
edges.  This  seems  to  be  a  considerably 
more  difficult  problem  than  moving  the 
wires  laterally,  since  the  component  of  the 
mesh  field  parallel  to  the  wire  gets  mixed 
with  the  larger  radial  electric  field  due  to 
charge  on  the  wire.  Simply  averaging  the 
mesh  fields  did  not  seem  to  work  well,  except  in  the  case  where  the  wire  was  tilted  in  one 
coordinate  plane,  and  the  mesh  fields  above  and  below  the  plane  of  the  wire,  on  edges  or¬ 
thogonal  to  the  wire  normal,  could  be  averaged.  Further  study  is  needed  to  develop  a  more 
general  capability  to  tilt  a  wire  with  respect  to  the  mesh. 


Fig.  3.  Real  impedance  of  a  dipole  antenna  with 
radius  0.001  m,  modeled  with  14  charge  nodes  and 
A z  =  0.1  m,  comparing  NEC  (solid)  and  FDTD 
(dashed).  FDTD  end  points  are  extended  by  8  for  ef¬ 
fective  lengths  of  1.3,  1.4  and  1.5  m. 
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A  Practical  Method  for  Increasing  the  Speed  and  Stability  of  the 
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A  method  is  investigated  to  speed  up  the  solve  time  of  matrix  problems  which  arise  in  moment 
method  numerical  methods  for  electromagnetic  field  simulations  using  the  mixed  potential  form 
of  the  EFIE  [1,  2],  The  method  hinges  on  transforming  the  matrix  to  a  curl-free  (TM)  and 
divergence-free  (TE)  basis.  A  singular  value  decomposition  is  then  separately  performed  on  the 
two  respective  diagonal  blocks  of  the  matrix.  The  result  is  a  natural  basis  and  testing  space 
that  separates  the  contributions  of  the  TE  and  TM  currents.  For  simulations  over  a  band  of 
frequencies,  there  is  great  potential  for  improvement  of  both  speed  and  stability.  Generating  the 
above  full  domain  boundary  elements  by  performing  the  SVD  on  the  matrix  at  a  central  frequency 
in  the  band,  and  by  extracting  this  matrix  from  those  at  the  surrounding  bands,  the  knowledge 
of  the  singular  values  leads  to  an  effective  preconditioner  over  the  matrices  in  the  band.  This  is 
effectively  applying  the  regularization  developed  by  Vecchi  et  al.  to  non-static,  yet  non-resonant, 
regimes  [3,  4].  Indeed  at  resonance,  the  ill-conditioning  of  the  EFIE  is  more  fundamental.  It 
is  found  that  only  the  dominant  (smallest)  singular  values  of  the  TM  block  need  to  be  retained 
in  order  to  express  the  solution,  the  current  on  the  planar  conductor.  As  such,  a  full  SVD  on 
the  TM  block  is  not  necessary,  only  the  first  several  singular  values  and  their  corresponding 
eigenfunctions  need  to  be  calculated.  These  are  the  currents  that  result  in  slowly  varying  (in 
space)  charge.  Unfortunately,  the  contribution  to  the  solution  from  the  TE  part  of  the  new  basis 
is  more  complicated.  In  addition  to  the  large  singular  value  slowly  varying  TE  current,  many 
more  rapidly  varying  TE  eigenfunctions  contribute  to  the  solution  in  order  to  satisfy  boundary 
conditions  at  the  ports  and  edges  of  the  planar  conductors.  Nevertheless,  by  preconditioning  the 
matrices  within  the  band  and  only  using  several  TM  basis  functions,  the  new  matrices  to  be  solved 
are  roughly  half  in  size  with  conditioning  that  admits  faster  iterative  techniques  in  place  of  the 
standard  LU  direct  solvers. 
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Figure  1:  The  original  banded  MoM  matrix  using  a  2— dimensional  rooftop  basis  for  a  16mm  long, 
.3 mm  wide  1— port  line  suspended  above  an  hypothetical  air  substrate  of  height  .5 mm.  The 
diagonal  elements  are  largest  and  subbanding  is  due  to  typical  numbering  scheme.  Here,  the  strip 
is  discretized  into  3X32  cells,  resulting  in  a  160X160  matrix. 
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GHz 

Figure  2:  The  top  figure  shows  the  condition  number  before  and  after  the  preconditioning.  The 
bottom  figure  shows  that  by  transforming  to  the  SVD  basis  functions  and  inverting  the  singular 
values  to  precondition  the  matrix,  sufficiently  small  error  relative  to  the  original  formulation  is 
maintained.  The  original  solution  is  solved  to  machine  accuracy  using  Matlab’s  LU  direct  solve. 


Figure  3:  The  TE  part  is  purely  an  integral  operator  and  has  rapidly  varying  currents  corresponding 
to  its  lowest  singular  values.  On  the  other  hand,  lowest  singular  values  of  the  integro-differential 
TM  operator  corresponds  to  slowly  varying  charge. 
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Figure  4: 
varying 


Modal  contributions  to  the  solution 


The  solution’s  make-up  in  terms  of  the  spectrum.  The  central  spike  corresponds  to  slowly 
TE  and  TM  components  contributing  to  the  solution  current. 
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The  solution 


An  important  slowly  varying  TM  contribution 


An  important  slowly  varying  TE  contribution 


An  important  rapidly  varying  TE  contribution 


Figure  5:  The  solution,  and  the  dominant  TM  and  TE  contributions.  Where  as  only  the  slowly 
varying  TM  modes  have  a  significant  contribution,  the  solution  calls  on  TE  modes  scattered  about 
the  TE  spectrum 
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Modeling  the  Bifilar  Helix  Antenna  using  NEC  4.1 
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Abstract:  The  Numerical  Electromagnetics  Code  (NEC  4.1)  is  used  to  model  the  radiation  characteristics  of  a  bifilar 
helix  antenna.  The  performance  parameters  of  interest  are  the  antenna’s  boresight  directivity  and  front-to-back 
ratio.  While  using  NEC  4.1  to  model  the  antenna’s  boresight  directivity  as  a  function  of  frequency,  it  was 
discovered  that  NEC  4.1  would  not  converge  to  a  unique  solution  as  a  function  of  the  segment  wire  radius  chosen  in 
the  model.  Numerical  results  are  presented  for  a  bifilar  helix  antenna  with  the  number  of  wire  segments  in  the 
antenna  and  the  segment  wire  radius  as  variables.  It  is  demonstrated  that  NEC  4.1  does  not  converge  to  a  unique 
solution  for  any  chosen  value  of  segment  wire  radius. 


Introduction:  The  bifilar  helix  antenna  can  be  designed  to  provide  moderately  high  directivity  with  good  circular 
polarization  axial  ratio.  The  antenna  is  simple  to  construct,  having  only  two  helical  arms  and  a  single  feed  point. 
Additionally,  directive  radiation  can  be  obtained  without  the  use  of  a  ground  plane  or  reflective  structure.  In  mariy 
applications,  the  bifilar  helix  is  an  attractive  alternative  to  a  monofilar  or  a  quadrifilar  helix  antenna. 

The  initial  objective  of  this  work  was  to  model  the  performance  of  the  bifilar  helix  antenna  as  a  function  of  the 
antenna’s  pitch  ratio  (axial  length  per  tum/diameter)  and  the  number  of  helical  turns.  Particular  emphasis  was  to  be  placed 
on  the  fact  that  the  antenna  could  be  operated  in  a  bi-directional  end  fire  mode  or  a  unidirectional  end  fire  mode. 

To  provide  a  comprehensive  analysis  of  this  antenna,  it  is  necessary  to  describe  the  antenna’s  boresigln  directivity  and 
front-to-back  ratio  over  some  predetermined  range  of  frequencies.  When  attempting  to  model  the  antenna’s  performance 
over  a  wide  frequency  range,  it  was  discovered  that  NEC  4.1m  would  not  converge  to  a  unique  solution  as  a  function  of  the 
antenna’s  wire  radius. 

This  paper  will  present  the  results  of  a  brief  numerical  study  of  the  bifilar  helix  antenna’s  boresight  directivity 
performance  as  a  function  of  frequency.  Front-to-back  ratio  information  is  not  presented.  The  modeling  parameters  varied 
were  the  number  of  segments  used  to  describe  the  antenna  and  die  wire  radius  (a). 


Antenna  Description:  The  antenna  that  was  modeled  in  this  study  was  a  2-ton  bifilar  helix  as  shown  in  Figure  1.  The 
antenna  has  a  diameter  of  6.706  centimeters  and  an  axial  length  of  13.41 1  centimeters. 

To  initially  model  the  bifilar  helix  using  NEC  4.1,  the  antenna  structure  was  broken  into  wire  segments  with  the 
antenna’s  helical  arms  being  subdivided  into  12  segments  per  helical  turn  and  the  antenna’s  straight  wire  sections  being 
subdivided  into  5  segments.  This  resulted  in  a  wire  segment  length  of  1 .823  centimeters  along  the  helical  structure. 

For  numerical  comparison,  the  antenna’s  helical  arms  were  also  subdivided  into  36  and  72  segments  per  helical  turn, 
resulting  in  a  segment  length  of  0.613  centimeters  and  0.307  centimeters,  respectively.  The  antenna’s  straight  sections 
were  subdivided  into  1 1  and  23  segments  for  the  36  and  72  segments  per  helical  turn  configurations,  respectively. 
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Figure  1.  3-dimensional  View  of  the  Bifilar  Helix  Antenna. 


Numerical  Results  -  Directivity  Analysis:  To  evaluate  the  performance  of  the  bifilar  helix  antenna,  the  first  parameters 
of  interest  were  the  antenna’s  horesight  directivity  and  front-to-back  ratio  as  a  function  of  frequency.  The  frequency  range 
of  interest  was  500  to  3000  MHz.  For  the  numerical  analysis,  the  boresight  directivity  was  calculated  every  10  MHz, 
resulting  in  25 1  frequency  sample  points.  Front-to-back  ratio  information  is  not  presented. 

To  determine  the  suitability  of  the  wire  segment  model,  the  wire  segment  length  and  overall  structure  dimensions 
were  compared  to  the  operating  wavelengths  across  the  frequency  band.  Table  1  illustrates  the  dimensional  comparison  of 
the  antenna  relative  to  the  operating  wavelength. 


Table  1.  Antenna  Structure  Dimensions  Compared  to  Operating  Wavelength 


500  MHz 

1750  MHz 

3000  MHz 

Wavelength 

59.96  cm 

17.13  cm 

9.99  cm 

Antenna  Length 

0.2237  X 

0.7829  X 

1.3425  X 

Antenna  Diameter 

0.1118  X 

0.3915  X 

0.6712  X 

Segment  Length  (12  segments) 

0.0304  X 

0.1064  X 

0.1824  X 

Segment  Length  (36  segments) 

0.0102  X 

0.0358  X 

0.0613  X 

Segment  Length  (72  segments) 

0.0051  X 

0.0179  X 

0.0307  X 

The  NEC  documentation  recommends  that  the  wire  segment  length  be  less  than  0.10  X  and  no  smaller  than  0.001  X. 
With  the  exception  of  the  configuration  with  12  segments  per  helical  turn,  this  condition  is  satisfied  across  the  entire 
frequency  band  of  interest. 
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The  wire  radius  (a)  selected  in  the  numerical  model  ranged  from  4.0  x  1C4  meters  to  1.0  x  10  meters.  The  NEC 
documentation  recommends  that  the  wire  segment  length  to  wire  radius  ratio  be  greater  than  8  and  that  2saA,  be  less  than 
j  Table  2  illustrates  the  segment  length  to  wire  radius  ratio  values  and  a  calculation  of  2rcaA.  for  the  different  wire  radii 
used  in  the  analysis  of  the  bifilar  helix  antenna. 


Table  2.  Segment  to  Wire  Radius  Ratio  (R)  and  a  Calculation  of  27ta fk. 


Radius  (a) 

2na  A. 

R  (12  segments) 

R  (36  segments) 

R  (72  segments) 

4  x  FP 

0.0147 

45.58 

15.33 

7.68 

2  x  10“* 

0.00734 

91.15 

30.65 

15.35 

1  xlCT1 

0.00367 

182.3 

61.3 

30.7 

7x10* 

0.00183 

364.6 

122.6 

61.4 

5.96  x  10^ 

0.00022 

3059 

1029 

515 

1  x  10^ 

3.67  x  10* 

18230 

6130 

3070 

1  x  10IU 

3.67  x  lfry 

0.18  x  10’ 

N/A 

N/A 

TiTio*3 

3.67x10'“ 

1.8  x  10,u 

N/A 

N/A 

1  x  10“’ 

3.67  x  10'1J 

1.8  x  10'^ 

N/A 

N/A 

I  x  HT' 

3.67  x  10  15 

1.8  x  10'° 

N/A 

N/A 

Note:  2rcaA.  is  calculated  at  1750  MHz 


With  the  exception  the  largest  wire  radius  and  the  72  segment  per  helical  turn  configuration,  all  of  the  wire  radii 
chosen  meet  the  NEC  documentation  requirements. 

The  first  bifilar  helix  antenna  modeled  was  the  12  segment  per  turn  configuration.  The  boresight  directivity  was 
initially  calculated  for  a  wire  radius  of  0.0004  meters  over  the  entire  frequency  band  of  500  to  3000  MHz.  In  an  attempt  to 
oain  confidence  in  the  numerical  model,  the  wire  radius  was  adjusted  and  the  directivity  versus  frequency  was  recalculated. 
At  this  point,  it  was  evident  that  the  NEC  4.1  calculations  had  not  converged  to  a  unique  solution  and  the  radius  was 
adjusted  again.  No  evidence  of  converge  was  obtained  for  any  value  of  radius  selected.  Figures  2  and  3  present  the 
boresight  directivity  of  the  bifilar  helix  versus  frequency  for  the  various  radii  of  Table  2. 

As  evident  in  Figures  2  and  3,  there  is  only  a  very  small  region  of  frequency  where  the  directivity  appears  to  converge 
as  a  function  of  wire  radius.  As  the  wire  radius  becomes  very  small,  less  than  1  x  1<T0,  there  appears  to  be  more  consistent 
results  in  the  lower  half  of  the  frequency  band  but  this  does  not  hold  at  the  upper  frequencies. 

The  next  step  in  the  analysis  process  was  to  increase  the  number  of  segments  per  turn  in  the  helical  arms  from  12  to 
36  and  then  from  36  to  72.  This  significantly  reduces  the  segment  length  relative  to  the  operating  wavelength.  The  values 
of  the  segment  length  to  wire  radius  ratios  are  well  within  the  requirements  as  specified  in  the  NEC  documentation. 
Figures  4  and  5  present  the  bifilar  helix  directivity  as  a  function  of  frequency  and  wire  radius  for  the  36  segment  and  72 
segment  per  helical  turn  antennas,  respectively.  The  wire  radius  is  only  varied  from  0.0004  meters  to  0.000001  meters.  As 
evident  from  Figures  4  and  5,  converge  is  not  reached  within  this  range  of  radii  values  for  either  configuration.  Apparent 
numerical  convergence  occurs  only  at  the  low  end  and  extreme  upper  end  of  the  frequency  band.  At  1650  MHz,  there  is  a 
spread  of  over  7  dB  in  calculated  boresight  directivity  over  the  range  of  radii. 

To  understand  how  the  number  of  segments,  or  rather  how  the  segment  length  affects  numerical  convergence,  the 
directivity  of  each  configuration  was  compared  at  fixed  radius  values.  Figures  6,  7  and  8  present  directivity  versus 
frequency  calculated  for  each  antenna  configuration  with  wire  radius  values  of  0.0004m,  0.00005  m  and  0.000001  m 
respectively.  From  these  figures,  it  is  evident  that  the  36  and  72  segment  per  helical  turn  configurations  are  m  very  good 
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agreement.  The  12  segment  per  helical  turn  configuration  is  consistent  with  the  other  configurations  but  it  obvious  that  it 
may  not  provide  sufficient  structural  accuracy.  From  these  curves,  it  is  apparent  that  consistent  results  can  be  obtained 
with  a  different  number  of  segments  per  helical  turn  at  a  given  wire  radius.  However,  it  is  interesting  to  note  that  this 
condition  is  not  valid  for  a  given  value  of  segment  length  to  wire  radius  ratio. 


Figure  2.  Directivity  versus  Frequency  as  a  Function  of  Wire  Radius.  Bifilar  Helix  with  12  Segment  per  Turn. 


Frequency  (MHz) 

Figure  3.  Directivity  versus  Frequency  as  a  Function  of  Wire  Radius.  Bifilar  Helix  with  12  Segment  per  Turn. 
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Figure  4.  Directivity  versus  Frequency  as  a  Function  of  Wire  Radius.  Bifilar  Helix  with  36  Segment  per  Turn. 


Figure  5.  Directivity  versus  Frequency  as  a  Function  of  Wire  Radius.  Bifilar  Helix  with  72  Segment  per  Turn. 
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Figure  6.  Directivity  versus  Frequency  as  a  Function  of  the  Number  of  Segments  per  Helical  Turn.  The  Wire 
Radius  is  0.0004  m. 
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Figure  7.  Directivity  versus  Frequency  as  a  Function  of  the  Number  of  Segments  per  Helical  Turn.  The  Wire 
Radius  is  0.00005  m. 
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Figure  8.  Directivity  versus  Frequency  as  a  Function  of  the  Number  of  Segments  per  Helical  Turn.  The  Wire 
Radius  is  0.000001  m. 


Numerical  Results  -  Pattern  Analysis:  In  addition  to  the  evaluation  of  boresight  directivity  as  a  function  of  frequency,  it 
was  important  to  evaluate  the  computed  radiation  pattern  of  the  bifilar  helix  as  the  wire  radius  was  varied.  The  radiation 
patterns  for  the  antenna  were  computed  at  650  MHz  and  1650  MHz  for  the  configuration  with  72  segments  per  helical  turn. 
These  frequencies  were  selected  because  it  appeared  that  numerical  convergence  occurred  at  650  MHz  but  it  did  not  occur 
at  1650  MHz.  Radiation  patterns  were  calculated  for  wire  radii  of  1  x  10"4  m,  1  x  10'  m  and  1  x  10  m.  Radiation  patterns 
are  presented  in  Figures  9  and  10  for  650  and  1650  MHz,  respectively. 

From  these  radiation  patterns,  it  is  obvious  that  good  numerical  convergence  does  exist  at  650  MHz  but  not  at  1650 
MHz.  Although  the  radiation  patterns  at  1650  MHz  for  the  different  wire  radii  have  almost  identical  maximum  directivity, 
the  variation  between  the  calculated  patterns  is  significant. 
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Figure  9.  Radiation  Pattern  of  the  72  Segment  per  Turn  Configuration  at  650  MHz.  Wire  Radius  Varied. 


Figure  10.  Radiation  Pattern  of  the  72  Segment  per  Turn  Configuration  at  1650  MHz.  Wire  Radius  Varied. 


Conclusion  and  Future  Work:  A  numerical  analysis  of  the  bifilar  helix  antenna  has  been  presented.  It  was  demonstrated 
that  with  a  reasonable  structure  size,  segment  length  and  wire  radius,  numerical  convergence  could  not  be  achieved  with 
NEC  4. 1  over  the  entire  desired  range  of  frequency  bandwidth.  As  the  wire  radius  was  varied,  as  much  as  7  dB  variation  in 
computed  boresight  directivity  occurred.  Future  work  should  include  additional  analysis  of  the  same  antenna  with  an 
increased  number  of  segments.  Additionally,  a  similar  analysis  should  be  performed  with  other  commercially  available 
Method  of  Moments  codes. 


References:  [1]  Gerald  Burke.  Numerical  Electromagnetics  Code -NEC  4.  Method  of  Moments.  January  1992. 
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1  Introduction 

Efficient  solution  of  dense  linear  systems  of  equations  arising  in  the  Method  of  Moment  (MoM) 
formulation  is  of  considerable  interest.  While  the  time  needed  to  fill  the  moment  matrix  is  pro¬ 
portional  to  the  square  of  the  number  of  unknowns  n,  the  solution  of  the  system 

ZI  =  V  (1) 

(Z  is  the  impedance  matrix,  /  is  the  unknown  current  vector,  V  is  the  excitation  vector)  using  a 
standard  Gauss  LU  factorization  scheme  requires  0(nz)  operations.  For  small  problems,  the  fill 
time  of  the  matrix  is  usually  higher  than  the  time  it  takes  one  to  factorize  it;  however,  for  large 
n,  the  reverse  is  true  and  factorization  time  dominates  the  fill  time,  and,  techniques  to  reduce  the 
factorization  time  are  highly  sought  after.  In  many  situations,  the  user  prefers  to  use  a  legacy 
MoM  and  speed  up  its  solution,  rather  than  rewriting  the  entire  code  using  the  Fast  Multipole 
or  other  similar  approaches  to  reduce  the  CPU  time.  The  principal  advantage  of  this  approach 
is  that  only  a  small  part  of  an  existing  program  needs  to  be  modified.  To  test  different  solution 
algorithms  and  to  demonstrate  their  ease  of  implementation,  we  have  chosen  to  add  functionality 
to  the  NEC  4  code.  Modifications  of  NEC  2  are  equally  possible  as  both  codes  differ  only  slightly 
in  their  solver  subroutines.  Beside  the  original  LU  decomposition  in  the  NEC  and  LAPACK’s  LU 
decomposition,  three  different  iterative  algorithms  and  four  different  preconditioning  techniques 
have  been  implemented  to-date. 

In  Sec.  2  of  this  paper,  we  describe  the  different  algorithms  and  identify  both  their  strong  and 
weak  points.  New  cards  that  use  the  added  functionality  are  presented  in  Sec.  3.  A  test  case 
involving  a  wiregrid  patch  array  with  1944  unknowns  is  discussed  in  Sec.  4. 

2  Solver  Algorithms  and  Preconditioners 

2.1  LU  Factorization 

The  preferred  algorithm  for  solving  general,  complex,  unstructured  linear  systems  is  the  Gauss 
elimination  or  LU  factorization.  As  a  first  step,  the  matrix  Z  is  factorized  to  yield  lower  and  a 
upper  triangular  matrices  L  and  U,  respectively.  The  unknowns  I  are  then  solved  for  in  two  steps, 
viz.,  forward  and  back  substitution.  Mathematical  libraries,  e.g.,  the  free  LAPACK  library,  provide 
good  and  reliable  routines  using  advanced  techniques  such  as  blocking.  For  comparison  purposes, 
we  have  included  the  LAPACK  routines  ZGETRF /ZGETRS  into  this  version  of  NEC  and  have 
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found  that,  in  general,  they  improve  the  factorization  and  solve  times.  This  is  even  more  so  when 
the  matrix-matrix  multiplication  subroutines  ZGEMM  are  tuned  for  the  appropriate  processor, 
or  when  vendor-provided  BLAS  libraries  are  used  (cf.  Acknowledgements). 

2.2  Iterative  Solver 

For  sparse  matrices,  i.e.,  matrices  where  only  a  few  elements  are  non-zero,  iterative  methods  that 
only  require  matrix-vector  multiplications  involving  the  original,  unchanged  matrix  and  some 
vectors,  are  found  to  decrease  significantly  the  computation  times.  For  full,  diagonally-dominant 
matrices,  the  iterative  solvers  can  reduce  the  computation  time  to  solve  (1)  quite  considerably. 

A  number  of  iterative  algorithms  have  been  published  in  the  literature.  It  has  been  demon¬ 
strated  that  the  quasi-minimal  residual  methods  and  their  transpose-free  variants  have  been  shown 
to  converge  more  smoothly  [1]  than  the  well-known  (bi-)conjugate  gradient  methods.  Fortran 
subroutines  for  various  iterative  solvers  are  available  on  the  internet  (http://www.netlib.org/ 
linalg/qmr).  In  modifying  the  NECsolver,  we  have  incorporated  three  different  iterative  al¬ 
gorithms:  the  Transpose-Free  Quasi-Minimal  Residual  (TFQMR),  the  Quasi- Minimal  Residual 
(QMR)  and  the  Biconjugate  Gradient  (BCG-from-QMR)  algorithms.  All  of  these  algorithms  are 
based  on  the  Lanczos  methods  without  look-ahead,  so  that  the  additional  memory  requirements, 
over  and  above  those  for  the  original  matrix  and  the  right-hand-side  vector,  are  quite  minimal. 
The  various  matrix-vector  multiplications  are  carried  out  by  using  the  BLAS  routines. 

For  the  problems  tested,  the  TFQMR  method  only  used  vector-matrix  multiplications  with 
the  untransposed  matrix.  Furthermore,  the  solution  vector  always  converged  to  the  required  limit 
faster  than  when  the  other  two  algorithms  are  used. 

2.3  Preconditioner 

Preconditioning  techniques  modify  the  original  matrix  Z ,  so  that  the  system  involving  the  resulting 
matrix  Z'  is  easier  and  faster  to  solve.  The  solution  I  of  the  preconditioned  system 

Z’  I  =  V  (2) 

for  the  excitation  V  is  an  estimate  for  the  exact  solution  Ie  of  (1).  Obviously,  Z'  must  represent 
the  original  matrix  Z  at  least  partially,  so  that  I  is  close  enough  to  Ie.  In  the  following  we  propose 
several  schemes  on  how  to  chose  Z'  for  a  given  Z. 

Since  the  preconditioning  of  (2)  introduces  some  error  when  approximating  the  matrix  Z,  I 
will  not  be  the  exact  solution  Ie  that  satisfies  (1).  To  improve  the  accuracy  of  /,  one  employs 
an  iterative  refinement  procedure,  which  uses  the  original  matrix  Z  as  well  as  the  preconditioned 
and  factorized  matrix  Z' .  The  algorithm  obtains  a  series  Ik  for  the  estimate  I  by  solving  for  the 
error  or  (residual)  I4-i  —  ZIk-\  from  the  equation 


Z'Ik  =  -  ZIk-X  =  14 


The  starting  point  for  the  first  estimate  is 

Z'IQ  =  V0  =  V 


(3) 

(4) 
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and  the  final  solution  I  is  derived  from 

/  =  /0  +  Ji  +  ...  +  Jn  (5) 

If  the  preconditioned  matrix  Z'  is  factorized,  each  iteration  necessitates  only  one  forward  and 
one  back  substitution  to  obtain  Ik.  Also,  Vk+1  is  obtained  by  one  matrix- vector  multiplication 
and  one  vector-vector  subtraction.  The  iteration  procedure  is  terminated  when  either  Vk  -  ZIk 
or  Ik  is  sufficiently  small.  Clearly,  both  the  matrix  Z  and  the  factorized  Z'  must  be  stored  in 
this  procedure.  We  therefore  use  preconditioners  that  require  only  small  additional  storage  for  Z'. 
Since  the  cost  for  one  iteration  described  in  (3)  is  only  0(n2),  the  iterative  algorithm  provides  an 
accurate  solution  much  more  rapidly  than  a  direct  solution  if  the  factorization  of  Z'  is  much  less 
expensive  than  the  one  of  Z. 

Physically,  the  different  matrix  elements  Z{j  express  the  fields  sampled  by  the  weighting  func¬ 
tion  i  generated  by  the  basis  function  j.  For  a  local  discretization  scheme  for  straight  wires,  this 
is  the  sampled  field  on  the  segment  i  at  the  position  f,  radiated  by  the  current  element  located 
on  segment  j  at  the  position  r '.  The  interaction  between  the  segments  that  are  separated  by^a 
large  distance  R  =  r-f  is  relatively  weak,  since  the  free-space  Green’s  function  decays  as  1/|J2|. 
The  interaction  between  two  segments  can  be  so  weak  that  the  corresponding  elements  of  Z{j  only 
have  a  very  slight  influence  on  the  solution  I.  There  are  two  possible  schemes  for  neglecting  the 
influence  of  these  elements,  viz.,  the  block  diagonal  and  band  preconditioning.  An  alternative  is 
to  employ  for  the  preconditioned  matrix  of  Z'  the  factorized  matrix  of  Z  obtained  previously  at 
another  frequency.  It  is  expected  that  this  would  yield  good  estimates  for  frequencies  /  close  to 
/';  however,  the  procedure  may  need  many  iterations  when  /  and  f  are  far  apart.  Finally,  the 
last  technique  is  to  reduce  the  matrix  size  and,  hence,  reduce  the  factorization  time,  by  assuming 
a  constant  relationship  for  the  different  elements  of  I. 

For  multiple  excitations  V,  the  iteration  described  in  (3)  must  be  carried  out  for  each  V.  For 
many  situations,  the  total  time  for  a  preconditioned  solver  can  become  higher  than  for  a  complete 
LU  factorization.  The  choice  to  apply  a  preconditioner  must  thus  be  made  by  the  user,  who  must 
decide  if  it  is  advantageous  for  him  to  do  so. 

2.3.1  Block  Diagonal  Solver 

For  scatterers  comprising  multiple  objects,  the  matrix  Z  can  be  partitioned  into  several  blocks. 
Each  diagonal  block  relates  the  currents  on  only  one  scatterer,  while  the  off-diagonal  blocks  express 
the  coupling  between  different  scatterers.  In  [2],  the  authors  have  presented  a  method  to  compress 
the  off  diagonal  blocks  by  incomplete  QR  and  to  include  them  into  the  solution  step.  In  this  paper 
we  propose  to  neglect  them  altogether  in  the  factorization  step. 

For  S  identical  scatterers  with  m  unknowns,  the  complete  factorization  cost  is  O  ((5m)3). 
Using  the  diagonal  blocks  reduces  this  to  only  SO(m3).  For  example,  for  81  scatterers  with  24 
unknowns  each,  the  blocked  factorization  is  6561  times  less  expensive  than  the  complete  factor¬ 
ization. 
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2.3.2  Band  Solver 


In  this  approach  we  neglect  the  interaction  of  segments  farther  apart  than  a  certain  threshold 
distance.  If  the  matrix  Z  is  ordered  such  that  for  each  element  Zij  the  distance  i  -  j  corresponds 
also  to  the  physical  distance  between  the  elements  i  and  j ,  we  can  easily  accomplish  this  by 
choosing  only  a  band  matrix  Z',  which  can  then  be  stored  and  solved  by  appropriate  banded 
matrix  algorithms  such  as  ZGBTRF/ZGBTRS  in  LAPACK. 

The  cost  for  one  band  factorization  for  a  banded  matrix  with  sub-  and  ku  super-diagonals 
is  2nkiku  flops.  With  ki,  ku  =  O.lra,  the  cost  for  a  factorization  is  only  0.03  or  3%  of  the  complete 
solution,  with  an  operational  count  of  2n3/3. 

This  technique  is  particularly  suited  for  scatterer  geometries  for  which  one  can  assume  that, 
at  some  distance,  the  coupling  between  two  segments  is  negligible. 

2.3.3  Modified  Iterative  Refinement  MIR 

To  implement  the  modified  iterative  refinement  algorithm,  the  factorized  matrix  of  a  previous 
solution  at  frequency  /'  is  used  for  the  preconditioned  matrix  Z1  [3].  However,  the  storage  re¬ 
quirements  are  doubled  in  this  case,  which  might  be  prohibitive  for  systems  with  a  large  number 
of  unknowns. 

2.3.4  Pattern  Solver 

Local  basis  and  testing  functions  in  the  Method  of  Moments  are  able  to  model  arbitrary  surfaces. 
This  is  not  only  true  for  general  3-D  shapes,  but  also  for  arbitrary  wire  type  structures.  Therefore, 
often  local  basis  and  testing  functions  are  typically  utilized  in  the  MoM  formulation  even  though 
this  leads  to  a  very  large  matrix.  However,  the  global  functions,  that  can  help  reduce  the  matrix 
size  considerably,  can  be  used  to  model  specific  geometries  very  closely,  though  they  are  less  suited 
for  general  structures.  We  propose  a  solver  algorithm  that  combines  the  advantages  of  both  of 
these  discretization  schemes,  viz.,  the  ability  to  model  arbitrary  shapes  with  local  basis  and  testing 
functions,  and  the  fast  solution  of  a  small  linear  system. 

In  many  cases,  the  spatial  distribution  of  the  current  on  the  scatterer  can  be  predicted  and 
only  the  amplitude  or  the  weight  coefficient  may  be  regarded  as  the  unknonwn.  In  those  cases, 
several  localized  functions  can  be  grouped  together  by  taking  advantage  of  the  known  shape  and 
applying  it  to  the  vectors  I  and  V  in  (1).  The  resulting  equation  is  a  smaller  system  generated 
by  two  matrix-matrix  multiplications  with  the  pattern  matrix  Pp: 

PpZPpI  =  PpV  (6) 

The  resulting  matrix  PpZPp  is  much  smaller  than  the  original  matrix  Z,  and,  consequently  the 
solution  time  is  much  smaller  as  well.  Usually  the  multiplications  in  PpZPp  are  not  carried  out 
explicitly,  because  Pp  is  sparse. 

Without  iterative  refinements,  the  errors  in  the  individual  current  distributions  can  be  high. 
Errors  for  the  near  and  far  field  quantities  are  usually  much  lower  (<C  10  %),  as  these  are  derived 
by  integration  from  the  currents,  a  process  that  tends  to  cancel  out  the  individual  errors. 
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2.4  Condition  Number 

A  measure  of  linear  system  sensitivity  is  the  condition  number 

K(Z)  =  \\Z\\  \\Z-' II  (7) 

defined  for  a  square  matrix  Z.  Several  different  condition  numbers,  that  are  based  on  the  use 
of  different  norms,  may  be  used  for  this  purpose,  though  the  most  frequently-used  are  1,  2  or 
oo  norms.  The  condition  number  as  defined  in  (7),  is  a  norm-dependent  quantity;  however,  all 
condition  numbers  are  somewhat  equivalent,  and  a  system  which  is  ill-conditioned  in  one  norm, 
i.e.,  has  a  high  k ,  is  also  going  to  be  ill-conditioned  in  another  norm  [4]. 

The  determination  of  the  condition  number  in  (7)  involves  the  computation  of  the  norm  of 
the  inverse  of  the  original  matrix;  however,  this  step  is  usually  not  performed.  Computing  the 
Koo  condition  number  is  exactly  three  times  as  expensive  as  solving  the  corresponding  system.  To 
check  if  the  system  in  (1)  is  ill-conditioned,  the  condition  number  can  be  estimated  by  using  the 
LU  factorization  that  involves  only  0(n2)  operations.  The  LAPACK  library  includes  a  subroutine 
to  estimate  the  condition  number  ZGECON.  Since  this  routine  is  based  on  the  LU  factorization, 
it  rarely  underestimates  the  actual  condition  number. 

3  New  Cards 

The  first  card,  the  MS  card  allows  the  user  to  choose  the  appropriate  algorithm  and  to  provide 
necessary  input  parameters.  Additional  input  parameters  for  some  of  these  solvers  are  provided 
by  the  MC  card.  The  condition  number  of  Z  in  (1)  can  be  estimated  by  using  the  LAPACK 
subroutines  [5],  that  are  selected  by  the  third  card,  the  MN  card.  The  format  of  these  three  new 
cards  is  shown  below. 

3.1  MS 

Purpose:  Select  the  algorithm  to  solve  the  linear  system  Z  I  —  V 

Command: 

MS  MSOLVER  MSNLIM  MSNBLKS  MSUNIT  EPS  REFEPS  BWRATE  R4 

II  12 _ 13 _ 14  FI  F2 _ F3  F4 

Parameters: 

II  -  0  -  original  LU  factorization  (default) 

-  1  -  LAPACK ’s  LU  factorization 

-  2  -  TFQMR  iterative  solver 

-  3  -  QMR  iterative  solver 

-  4  -  block  diagonal  solver 

-  5  -  band  solver 

-  6  -  pattern  solver 

-  7  -  MIR  solver 

-  8  -  QMR  from  BCG  solver 
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3.2  MC 

Purpose:  Provide  more  data  for  solver  algorithm. 


Command: 

MC  II  12  13  14  FI  F2  F3  F4 
Parameters: 

II,  12, 13, 14  -  integer  data  for  solver  algorithm 

FI,  F2,  F3,  F4  -  floating  point  data  for  solver  algorithm 
This  card  is  presently  only  used  for  the  block  diagonal  solver  where  II,  ...,  14  are  block  sizes. 

3.3  MN 

Purpose:  Estimate  the  condition  number  of  the  moment  matrix. _ 

Command: 

MN  ICOMPCN 


Parameters: 

ICOMPCN  -  0  -  don’t  estimate  condition  number  (default) 

-  1  -  estimate  condition  number.  This  also  selects  the  LAPACK  LU  factorization 

4  Numerical  Examples 

As  an  illustrative  example,  we  consider  a  9  by  9  array  .of  wiregrid  patches  of  size  0.3  m  by  0.3  m, 
with  a  center-to-center  distance  of  1.0  m.  Each  patch  is  modeled  with  24  unknowns,  which  leads 
to  a  total  of  1944  unknowns.  The  associated  moment  matrix  occupies  58  MB.  We  compute  the 
currents  on  the  patches,  for  a  normally-incident  plane  wave,  and  for  frequencies  of  /  =  299.8  MHz 
and  /  =  500  MHz.  All  examples  are  run  on  a  single  332  MHz  PPC  node  of  an  IBM  SP  system. 
The  fill  time  for  all  of  the  examples  considered  is  approximately  39  s.  The  different  run  times 
aggregate  of  factorization  and  solution  times  achieved  for  this  numerical  experiment  are  displayed 
in  Table  1. 

It  was  found  that  substantial  improvements  in  run  times  can  be  achieved  on  the  IBM  as  well 
as  on  a  Dec  Alpha  system  by  using  LAPACK  routines,  in  conjunction  with  the  tuned  GEMM 
routines  and/or  the  vendor-provided  BLAS  routines  in  the  ESSL  library. 

The  results  for  the  QMR  and  the  BCG  iterative  solver  are  omitted  as  these  two  algorithms 
yield  slower  runtimes  than  did  the  TFQMR  algorithm.  For  low  frequencies  (below  the  patch 
resonances)  the  matrix  is  diagonally  dominant;  hence  the  iterative  solver  converges  and  the  total 
solution  time  is  4.4  times  faster  than  it  is  for  the  original,  complete  LU  factorization.  In  addition, 
the  additional  storage  for  the  work  vectors  is  negligible.  However,  for  /  =  500  MHz,  the  patches 
exhibit  resonance,  and  the  interaction  between  different  patches  becomes  important.  The  matrix 
is  no  longer  diagonally  dominant  when  this  happens,  and  the  algorithm  fails  to  converge. 

The  band  preconditioner  retains  a  sufficient  number  of  off-diagonal  elements  in  both  cases 
to  achieve  convergence  during  the  refinement  steps.  With  this  preconditioner,  the  total  solution 
time  is  17  times  less  than  it  is  for  a  complete  factorization,  and  the  corresponding  figure  is  3.3  for 
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LAPACK.  The  total  memory  needed  to  store  the  two  matrices  is  75  MB  -  58  MB  for  the  original 
matrix,  and  17  MB  for  the  factorized  banded  matrix. 

At  /  =  500  MHz,  close  coupling  between  the  individual  patches  becomes  apparent  in  the 
process  of  using  the  block  preconditioner.  If  we  choose  81  blocks  for  the  matrix,  one  for  each 
patch,  the  algorithm  fails  to  converge.  However,  if  we  include  some  interaction  between  the 
patches  by  increasing  the  block  size  to  72,  216,  or  648,  we  can  again  achieve  convergence.  The 
fastest  solution  time  is  achieved  with  27  blocks  comprising  72  unknowns  each,  and  this  is  26  (5.2 
for  LAPACK)  times  faster  than  it  is  for  complete  LU  factorization.  The  storage  for  the  different 
block  matrices  ranges  from  2.1MB  for  27  blocks  to  6.4  MB  for  3  blocks,  for  a  total  that  ranges 
between  61  MB  to  65  MB. 

To  compute  the  pattern  when  using  this  algorithm  one  needs  some  information  about  the 
current  distribution,  e.g.,  that  obtainable  from  a  previous  solution.  In  our  numerical  experiments 
we  used  the  current  distribution  from  a  previous  run  at  a  frequency  which  is  lower  by  50  MHz.  The 
pattern  used  for  the  central  three  rows  were  derived  by  averaging  the  currents  on  the  elements 
of  the  corresponding  row,  and  this  enabled  us  to  reduce  the  number  of  unknowns  from  1944 
to  1438  (74%).  The  errors  in  the  current  distributions  can  be  relatively  high  if  no  iterative 
refinements  are  carried  out;  nevertheless  near  and  far  field  quantities  only  exhibit  a  maximum 
error  of  2%  and  5.6%  for  the  non-resonant  and  resonant  cases,  respectively.  Additional  storage 
needed  for  the  reduced  matrix  is  32  MB,  with  the  total  being  89  MB. 

The  MIR  algorithm  [3]  is  the  most  memory-demanding  of  all  among  the  ones  we  have  investi¬ 
gated,  and  since  it  requires  that  both  the  complete  factorized  matrix,  as  well  as  the  new  matrix, 
be  held  in  memory,  the  total  memory  requirement  is  twice  that  needed  for  complete  LU  factor¬ 
ization.  For  the  present  problem  this  total  is  119  MB,  which  is  rather  large.  Additionally,  the 
largest  frequency  step  for  the  two  frequencies  of  interest  was  10  MHz,  which  is  relatively  small. 
Other  techniques  for  broad  band  computation,  such  as  the  MBPE  [6],  appear  to  be  more  efficient 
for  handling  this  case. 

5  Conclusions 

We  have  presented  a  number  of  different  techniques  for  speeding  up  the  solution  of  linear  systems 
of  equations  arising  in  the  application  of  the  NEC  code.  These  techniques  either  handle  the  full 
matrix  by  using  different  algorithms,  or  precondition  it  to  speed  up  the  solution.  They  are  not 
general-purpose,  however,  and  the  choice  of  the  algorithm  is  left  to  the  user  who  selects  the 
appropriate  algorithm  via  new  input  NEC  cards. 

The  use  of  publicly-available  subroutines  obtainable  from  the  internet,  enables  one  to  easily 
enhance  the  performance  of  legacy  codes  with  relatively  minor  changes. 


Acknowledgements 

The  use  of  matrix-matrix  multiplication  kernels  (GEMM)  tuned  for  RISC  processors  by  Michel 
Dayde,  Iain  Duff  et  al.  is  acknowledged.  The  package  was  obtained  from  http://www.enseeiht . 
f r /Recherche/Inf o/apo/Numeriq . html . 


434 


Table  1:  Run  times  for  a  9  by  9  array  of  wiregrid  patches 


/ 

=  299.8  MHz 

/ 

=  500.0  MHz 

algorithm 

factor 

solution 

total 

factor 

solution 

total 

in  s 

in  s 

in  s 

in  s 

in  s 

in  s 

original  LU 

378.19 

1.10 

379.29 

376.79 

1.13 

377.92 

LAPACK  LU 

72.76 

0.44 

73.20 

75.47 

0.45 

75.92 

TFQMR 

0.00 

85.76 

85.76 

no  convergence 

band  with  10  %  bandwith 

2.60 

19.53 

22.13 

4.68 

13.16 

17.84 

block  diagonal  (3  blocks) 

10.02 

7.66 

17.68 

9.84 

6.60 

16.44 

block  diagonal  (9  blocks) 

1.06 

10.94 

12.00 

1.08 

13.01 

14.09 

block  diagonal  (27  blocks) 

0.15 

9.74 

9.89 

0.19 

14.29 

14.48 

block  diagonal  (81  blocks) 

0.04 

13.24 

13.28 

no  convergence 

pattern 

33.23 

0.70 

33.93 

34.67 

0.71 

35.38 

MIR 

0.00 

0.00 

44.75 

no  convergence 

Note:  Performance  improvements  between  the  original  LU  factorization  and  LAPACK’s  factorization  vary  with 
CPU,  problem  size,  and  impact  of  the  tuned  ZGEMM  subroutines.  Improvements  on  different  platforms  can  be 
less  spectacular. 
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ABSTRACT 

Modeling  of  moderately  thick  wires  can  often  involve  feedpoint  effects,  which  may  or  may  not  be 
handled  properly  when  compared  to  actual  physical  construction  and  experimental  measurements.  In 
this  paper,  it  is  shown  that  attaching  the  two  feedpoint  wires  on  one  side  of  a  hollow  cylindrical  tube  in 
the  form  of  a  dipole  will  introduce  a  frequency  shift  in  the  impedance  or  VSWR  from  what  NEC4 
would  compute.  A  code  such  as  WIPL,  which  uses  a  combination  of  plates  and  wires  in  an  EFIE 
approach,  is  shown  to  produce  results  which  are  correlated  well  with  experimental  measurements  of  the 
physical  dipole.  Finally,  this  paper  shows  that  a  correction  factor  in  length  can  be  implemented  to 
produce  equivalence  with  NEC4  results  even  for  bent  stubs  at  the  end  of  the  dipole.  This  has 
implications  in  using  codes  such  as  NEC4  for  optimization  in  arrays  and  construction  of  physical 
antenna  elements. 

1.  INTRODUCTION 

This  paper  will  describe  the  modeling  and  measurements  of  a  thick  bent-wire  dipole  that  is  used  as  a 
driven  element  in  a  Yagi  that  forms  one  of  the  elements  of  a  large  circular  array.  This  array  is  called 
the  UHF  Electronically  Scanned  Array  (UESA)  and  is  being  proposed  as  a  replacement  for  the  current 
mechanically  rotated  array  which  is  attached  on  top  of  an  aircraft  for  Navy  radar  applications  similar  to 
the  AWACS  airborne  radar  array.  It  was  desired  to  know  two  effects  in  the  construction  of  such  a 
dipole  element:  a  bend  in  the  dipole  at  the  ends  of  the  wire  to  allow  its  space  usage  without  overlap  of 
nearby  neighbors,  and  the  attachment  of  the  feedline  to  the  center  feedpoint  of  the  dipole.  The  NEC4 
computer  antenna  modeling  program  has  been  used  exclusively  to  optimize  the  Yagi  array  as  a  whole 
in  a  package  called  NECOPT  [1].  The  authors  recently  received  another  modeling  package  called 
WIPL  [2].  This  WIPL  program  allows  the  user  to  interactively  define  the  geometry  of  any  metallic 
structure  as  a  combination  of  wires  and  plates  and  uses  the  EFIE  formulation  for  solution  of  the 
currents,  impedance,  patterns,  etc.  The  user  can  further  check  his  data  by  using  a  built-in  3-D  drawing 
capability  of  the  structure.  The  program  also  provides  the  user  with  a  variety  of  list  and  graphic  output 


436 


capabilities,  including  2-D  and  3-D  graphics.  The  authors  only  had  the  reduced  version  of  the  WIPL 
program  which  has  the  unknowns  limited  to  100  which  was  satisfactory  for  the  modeling  described  in 
this  paper  using  symmetry.  The  GNEC  package  was  used  for  all  NEC4  modeling  in  this  paper  and  has 
extensive  pre  and  post-processing  under  Windows  for  model  input  and  display  of  results  [3]. 

2.  DIPOLE  ANTENNA  DESCRIPTION 

The  dipole  in  this  paper  is  a  moderately  thick  and  hollow  conductor  being  3A  inches  in  diameter  and 
having  a  half-length  of  7.43  inches  in  both  the  straight  and  bent  configurations.  It  is  designed  for 
operation  around  400  MHz  and  it  was  found  experimentally  that  the  resonant  frequency  could  be 
altered  simply  by  changing  the  feedpoint  attachment  Also,  the  dipole  ends  must  to  be  bent  to 
accommodate  its  fitting  into  the  array  without  overlapping  with  its  nearest  neighbors  on  each  side, 
since  it  is  being  used  in  a  circularly  symmetric  array.  This  bend  also  cause  a  slight  change  in  the 
resonant  frequency  of  the  isolated  dipole  element. 

The  feedpoint  can  either  be  attached  with  some  form  of  screw  connection  on  one  side  of  the  hollow 
cylindrical  tube  (in  this  case  copper  tubing)  or  at  several  attachment  points.  The  NEC  applied  E-field 
voltage  source  model  assumes  that  the  electrical  field  is  completely  uniform  in  the  gap  region,  which  is 
very  similar  to  what  would  occur  in  a  capacitor  between  the  plates.  Since  the  optimization  used  NEC4, 
it  was  required  to  use  another  code  and/or  measurement  that  could  give  the  same  impedance  as 
computed  by  NEC4  after  an  adjustment  was  made  to  the  actual  dipole’s  length.  It  was  found  that  this 
equivalent  NEC4  dipole  could  be  achieved  in  practice  with  a  bent  stub  length  of  2.125  inches  instead 
of  2.75  inches  as  was  used  in  NEC4. 

The  WIPL  code  can  actually  model  the  cylindrical  wire  with  plates  and  then  connect  a  wire  between 
the  hollow  tubes,  much  like  that  in  the  actual  physical  dipole  connection.  This  is  shown  in  Figure  1  for 
both  the  straight  and  bent  configurations  with  only  half  of  the  dipole  shown  in  each  case.  Symmetry  is 
employed  in  this  code  to  model  the  other  half.  The  wire  can  also  be  seen  connected  to  one  side  of  the 
hollow  tube.  Only  half  of  the  dipole  is  modeled  in  the  other  plane  as  well  to  conserve  unknowns  in  the 
version  of  this  code  that  was  provided  to  the  authors.  Three  total  types  of  feed  arrangements  were  tried 
for  this  model  and  are  shown  in  Figure  2.  The  Type  1  feed  is  as  already  described  and  the  Type  2  feed 
uses  2  wires  and  the  Type  3  feed  uses  4  wires  (the  two  additional  feeds  provided  by  a  PMC  boundary 
in  the  XOY  plane)  as  shown  to  produce  a  more  uniform  field  distribution  which  should  be  more  similar 
to  NEC4. 
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Type  1  Type  2  Type  3 


Figure  2 

3.  RESULTS 

Results  are  shown  in  the  form  of  both  VSWR  and  input  impedance  for  several  different 
implementations  of  the  basic  configurations  described  above.  WIPL  results  for  a  straight-wire  model 
are  shown  in  Figure  3  for  the  three  different  feed  types,  respectively.  Similarly,  WIPL  results  for  the 
bent  stub  case  of  2.75  inches  are  shown  in  Figure  4  for  the  same  three  different  feed  types, 
respectively.  It  was  found  that  the  bent  stub  length  could  be  adjusted  to  give  an  impedance  essentially 
the  same  as  NEC  using  its  applied  E-Field  source.  The  stub  length  for  this  condition  was  found  to  be 
2.125  inches  instead  of  the  2.75  inches  as  used  in  the  NEC4  optimization.  These  results  are  shown  in 
Figure  5  for  WIPL  and  Figure  6  for  NEC4.  This  WIPL  result  for  the  shorter  bent  stub  is  only  done  for 
the  type  1  feed  case  since  this  is  the  case  actually  used  in  practice.  A  measurement  was  taken  for  the 
bent  short  stub  case  which  was  built  and  tested  in  the  lab  for  a  stub  length  of  2.125  inches  instead  of 
2.75  inches  and  the  resulting  VSWR  is  shown  in  Figure  7. 

4.  CONCLUSIONS 

It  has  been  shown  that  the  WIPL  code  using  the  EFIE  formulation  can  accurately  model  the  case  of  a 
dipole  fed  by  connecting  a  feedline  to  one  side  of  a  hollow  cylindrical  conductor.  Furthermore,  it  has 
been  also  shown  that  actual  physical  and  WIPL  impedance  results  can  be  made  to  be  equivalent  to  that 
of  a  NEC4  model  by  simply  shortening  the  total  length  of  the  dipole  (in  this  case  a  bent  stub)  to  form 
an  equivalent  dipole  which  can  be  used  in  optimization  studies  based  on  NEC4.  Agreement  between 
experimental  measurements  and  both  independent  codes  can  then  be  obtained. 
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Introduction 

In  the  early  eishties  when  I  was  preparing  an  overview  [1]  on  VLF,  LF  and  MF  antennas  I  wrote  that  the  present  day 
availability  of  microcomputers,  combined  with  sophisticated  antenna  modelling  codes  (method  of  moments),  provides  the 
tools  needed  to  more  rigorously  determine  the  characteristics  of  MF  broadcast  antennas.  And,  in  the  case  of  multi-element 
broadcast  arrays,  the  calculated  currents  in  amplitude  and  phase,  which  result  in  the  desired  pattern,  should  compare  more 
exactly  to  those  measured,  thus  facilitating  initial  set-up  and  reducing  the  time  required  to  achieve  the  desired  pattern  by 
field  measurement”. 

But  my  first  attempt  at  numerically  modelling  MF  broadcast  antennas  was  not  very  satisfactory.  The  tools  I  had  were 
MININEC  and  NEC-2.  While  NEC-2  does  account  for  a  Summerfeld-Norton  ground  beneath  the  antenna,  the  wire  model 
must  not  touch  the  ground.  I  devised  a  trick  to  simulate  "ground  connection",  viz.  using  elevated  resonant  radials.  This 
launched  me  into  a  detailed  study  of  elevated  radials  [2,3,4].  More  recently,  since  now  I  am  now  licensed  to  use  NEC-4  (I 
use  EZNEC  pro  4D),  I  can  model  antennas  with  buried  radial  ground  systems.  However,  my  initial  studies  seemed  only  to 
result  in  new  problems  to  be  concerned  about. 

Why  am  I  concerned  with  modelling  MF  broadcast  antennas?  One  reason  is  a  current  interest  in  near  fields.  This  prompted 
a  correspondence  exchange  with  Valentin  Trainotti,  LU1ACM,  since  Tino  measured  near  fields  for  two  types  of  MF 
broadcast  antenna  systems  [5],  While  I  agreed  in  principle  with  his  views,  I  disagreed  with  some  of  his  statements,  and 
with  some  of  the  conclusions  reached  [6].  Also,  since  I  am  a  subscriber  to  the  NEC  Listserver  <dave@ee.ubc.ca>,  this 
put  me  in  touch  recently  with  three  MF  broadcast  engineers,  Sylvio  Damiani,  Brazil,  Alberto  Fassio,  Italy,  and  Grant 
Bingeman.  KM5KG,  with  Continental  Electronics,  Dallas,  TX.  The  stations  in  Brazil  and  Italy  employ  folded-unipole 
radiators,  an  antenna  type  not  used  in  North  America.  And,  the  station  in  Brazil  employs  elevated  radials,  at  quite  a  low 
height,  and  a  hish  transmitter  power  (100  kW),  which  raises  a  concern  about  the  magnitude  of  the  near  fields  beneath  the 
elevated  radials --  but  that  is  a  subject  for  follow  on  studies.  Finally,  in  December  1997, 1  received  a  request  for  advice  and 
help  in  resolving  a  number  of  MF  antenna  problems  in  Australia  and  New  Zealand  [Stephen  White,  private 
communications,  1997], 

This  paper  is  a  progress  report,  which  summarizes  some  of  my  recent  work.  "Progress",  since  at  the  time  of  writing, 
preparations  are  underway  to  conduct  experiments  in  Taupo,  on  the  North  Island,  New  Zealand,  which  will  compare 
performance,  for  the  same  MF  tower,  at  the  same  site,  contrasting  six  elevated  radials  with  120  -  0.75  wavelength  (0.75  X) 
buried  radials. 

Antenna  Systems  without  a  Buried  Radial  Wire  Ground  System 

A  dipole-type  vertical  radiator  can  be  used  as  a  MF  broadcast  antenna  system,  without  the  120  buried-radial-wire  ground 
system.  Trainotti  [5]  uses  such  an  antenna  system  (see  Fig.  1),  which  he  calls  an  asymmetric  vertical  half  wave  dipole. 

I  have,  in  previous  publications,  referred  to  such  an  antenna  as  a  ground-plane  type  antenna  with  drooping  radials.  In  the 
arrangement  used  practically,  the  top  70  metre  (m)  section  of  a  130  m  high  tower  is  fed  against  four  sloping  radial  wires, 
each  about  a  quarter  wavelength  long  (70  m  for  a  frequency  of  1  MHz).  The  angle  between  the  sloping  radials  and  the  tower 
is  45°.  Since  the  tower  is  not  broken  he  uses  a  cage  feed,  arranged  as  a  folded  unipole,  with  four  drop  wires,  for  the  top 
section.  The  tower  is  sitting  on  the  ground,  "grounded"  for  lightning  purposes  by  a  ring  of  ground  rods.  We  have  assumed 
that  the  tower  itself,  for  purposes  of  modelling,  is  sunk  into  the  ground  to  a  depth  of  5  m. 

I  have  numerically  modelled  this  antenna  system  (using  NEC  4D)  to  compute  ground  wave  field  strength  and  impedance.  I 
compute  the  field  strength  at  1  km  E(1  km)  =  127.7  dB(iV/m,  for  a  transmitter  power  of  50  kW;  and  Za  =  148  +  j  9  ohms. 
The  measured  FS  was  128  dBpV/m,  and  the  antenna’s  impedance  is  said  to  be  between  150-200  ohms.  The  broadcast 
station  site  has  very  good  ground  conductivity  (a  =  40  mS/m,  s  =  30). 
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Fig.  1  Asymmetric  half-wave-dipole-type  antenna, 
with  folded  unipole  feed.  Sketch  above  shows  detail 
on  this  method  of  feed. 


Conventional  Buried  Radial  Wire  Ground  Systems 

The  ground  systems  used  by  MF  broadcasters  are  based  on  the  results  of  the  (classical)  experimental  studies  of  Brown 
et.al.  [7],  I  have  modelled  and  re-modelled  (making  changes  to  the  model)  the  pipe  monopole  system  used  by  Brown  and 
his  colleagues,  frequency  3  MHz,  in  an  attempt  to  obtain  an  agreement  with  measure  results.  But  I  have  to  report  that  I  did 
not  find  much  agreement.  I  have  computed  impedance  vs.  tower  height  and  vs.  length  and  number  of  radials;  and  field 
strength  in  the  same  way  as  FS  was  measured  (Brown  etal.  measured  FS  at  0.3  miles  and  assumed  a  1/distance  (1/d)  rate  of 
change  to  1  mile,  the  field  strengths  plotted).  I  have  also  looked  at  the  decrease  in  current  along  a  radial,  but  the  agreement 
was  not  good. 


These  authors  mention  that  the  ground  conductivity  at  the  field  site  was  2  mS/m,  I  suspect  (judging  from  the  photographs 
shown)  that  the  conductivity  was  much  higher,  but  computing  results  for  different  conductivities  did  not  help.  We  have  to 
conclude,  if  we  believe  in  NEC  4,  and  in  Maxwell's  equations,  that  their  measurements  were  too  crude  to  expect  to  find 
agreement  However  the  trends  are  right. 

Most  MF  broadcast  stations  (based  on  the  results  of  Brown  et.  al.)  typically  employ  120  radials  of  length  0.25  X  to  0.4  X. 
Broadcast  towers  in  Australia  and  New  Zealand  (which  we  discuss  below)  employ  radials  of  length  150  m  to  200  m,  buried 
to  a  depth  of  0.2  m. 


Before  we  present  any  results  from  our  files  of  numerical  modelling,  let  me  describe  the  model  we  use.  Clearly  if  one 
looks  at  a  real  antenna,  we  do  not  put  a  source  on  a  thick  tower,  we  connect  a  wire  to  a  thick  tower  and  the  source  is  on 
that  wire.  A  source  on  a  wire,  connecting  to  the  multi-wire  radial  ground  system  seemed  to  be  a  better  method  to  model  the 
buried  ground  screen.  That  is  the  connection  to  the  ground  system  is  a  heavy  gage  wire.  The  lower  end  of  this  wire 
touches  the  ground.  Next  we  connect  a  wire,  having  the  same  segment  length  as  this  wire  on  which  we  placed  the  source, 
which  slopes  downward  at  45°,  and  connects  to  the  buried  horizontal  radial,  see  Fig.  2.  The  first  segment  on  the 
horizontal  radial  has  the  same  segment  length  as  this  sloping  wire,  and  we  use  a  segment  taper  option  (available  with 
EZNEC  pro  4D),  for  the  horizontal  radial. 


Fig.  2  Sketch  illustrates  a  method  of  modelling  a 
buried  radial  wire  ground  system  connecting  to  a  thick 
tower. 
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I  have  modelled  towers  without  guys.  and  towers  with  guys,  fed  against  a  PEC  ground;  and  towers  without  guys  fed  against 
120-200  m  lono  radials  buried  0.2  m.  average  ground.  Before  presenting  results  which  show  in  detail  impedance  vs. 
frequency  (or  tower  height  in  wavelengths),  let  us  look  (for  reference)  at  the  differences  for  the  case  of  a  single  frequency, 
between  these  models.  ‘'Consider  a  120  m  high  tower,  diameter  1.58  m  (I  have  measured  data  for  such  a  tower,  for  two 
installations.  Fairview  and  Highcliff.  NZ,  see  below).  The  antenna's  impedance  at  625  kHz  (h fX  =  0.25)  for  this  tower 
without  guys  (PEC  ground)  Za  =  57  +  j  38  ohms,  Eu  =  263  mV/m;  with  guys  (PEC  ground),  see  Fig.  4,  Za  =  65  +  j  85 
ohms.  E^  =  269  mV/m  —  and  with  120-200  m  long  radials  buried  to  a  depth  of  0.2  m  Za  =  65  +  j  40  ohms,  Eu  =  254 
mV/m.  The  unattenuated  field  strength  at  1  km,  for  1  kW,  has  been  calculated  from  the  FS  at  300  m,  assuming  a  1/d  rate 
of  change. 

In  what  follows  (Figs.  3  and  5)  we  assume  a  PEC  ground.  Previous  work  has  shown  differences,  when  comparing 
various  curves  of  impedance  versus  frequency,  or  physical  height  H  in  electrical  degrees.  Since  the  differences  between  the 
curves,  measured  for  different  installations,  or  measured  data  compared  with  early  prediction  methods,  could  apparently  be 
brought  into  better  agreement  by  shifting  one  of  the  curves  by  an  amount  dependent  on  height,  it  was  speculated  that  the 
differences  are  probably  due  to  stray  effects,  such  as  base  insulator  capacitance,  capacitance  of  guy  insulators,  lighting 
transformer,  and  etc.  [1]. 

Stephen  White  has  provided  me  with  many  tables  of  impedance  measurements  for  a  number  of  towers  of  different  heights 
and  tower  diameters.  We  will  consider  here  measurements  for  two  towers,  Fairview  and  Highcliff,  NZ.  One  tower  is  quite 
a  bit  taller  compared  with  the  other,  but  the  height/diameter  ratios  are  about  the  same  (H/D  =  76).  These  towers  were 
mounted  over  an  extensive  radial  ground  system,  described  above. 

In  Fig.  3  we  compare  calculated  base  impedance  for  a  tower  with  no  guys,  with  measurement  The  continuous  curve  is 
our  computed  result.  The  data  points  for  Ra  are  plotted  as  circles;  squares  for  Xa.  Open  circles  and  squares,  and  closed 
circles  and  squares  refer  to  data  for  the  two  installations.  The  agreement  between  measurement  and  computer  simulation  is 
not  very  good. 


Data  for  Fairview  and  HighdifT 
Ra  open  and  dosed  rirdes 
Xa  open  and  dosed  squares 


Fig.  3  Simulation  impedance 
versus  tower  height  in 
wavelengths  for  a  tower  with 
no  guys,  compared  with 
measurements  at  two  locations, 
Fairview  and  Highcliff,  New 
Zealand. 


If  we  include  guys  in  the  model,  broken  by  insulators,  see  Fig.  4  (30  pF  capacitors  for  the  break-up  insulators,  and  26  pF 
capacitors  for  the  high  voltage  insulators  where  the  guy  wires  connect  to  the  tower),  we  obtain  pretty  good  agreement,  see 
Fig.  5.  While  the  differences  between  the  curves  computed  versus  measured  is  greatest  near  the  half-wave  resonant  height, 
it  should  be  noted  that  there  is  also  a  difference  between  the  measured  data  for  this  tower  height. 


Elevated  Resonant  Radials 

To  illustrate  some  of  the  pertinent  factors  and  expectations  if  elevated  radials  are  used,  and  to  validate  NEC  4D  we  consider 
here  a  two  tower  array,  MF  broadcast  station  KGGN,  operating  on  890  kHz,  located  near  Gladstone,  MO.  We  are  fortunate 
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to  have  the  detailed  proof  of  performance  report  for  this  station.  I  have  already  published  some  results  of  this  study  [4], 
but  ACES  provides  a  different  forum,  and  many  of  you  will  not  have  seen  my  Communications  Quarterly  article. 


KGGN  employs  two  vertical  radiators  of  60  m  height  (physical  height  64  electrical  degrees),  69.3  m  (74  electrical  degrees) 
apart,  on  a  bearing  of  N  69°  E  True.  The  towers  are  guyed,  uniform  triangular  cross  section,  face  width  0.457  m.  The 
ground  system  consists,  for  each  antenna,  of  six  counterpoise  wires  arranged  at  60°  intervals  for  symmetry,  86  m  long. 
The  radii  wires  are  elevated  5  m  above  ground  until  the  radials  come  to  a  point  5  m  from  each  tower  base.  At  this  point, 
they  slope  downward  at  an  approximately  45°  angle,  terminating  in  insulators  which  are  attached  to  the  concrete  base  pier. 
The  base  height  of  the  towers  is  1  m  above  ground.  Note:  the  radials  are  too  long  to  be  resonant,  resonant  length  for 
radials  at  a  height  of  5  m  over  average  ground  is  80.86  m.  The  two  counterpoises  are  not  connected  to  each  other  (and 
cannot  accidentally  touch)  except  at  the  base  of  each  tower,  where  a  copper  strap  runs  along  the  tower  line  and  through  the 
transmitter  building.  The  copper  strap  is  for  lightning  protection,  tying  the  phasor,  transmitter,  collection  rings  of  each 
counterpoise,  mains  breaker  panel,  and  telephone  gas  protectors  to  a  common  earth.  For  our  model  we  have  assumed  that 
the  connection  to  earth  is  by  means  of  a  10  m  ground  rod  —  our  modelled  antenna  array  is  sketched  in  Fig.  6.  Tower  1, 
the  tower  on  the  left,  carries  a  current  of  1.0  Z0°  amperes;  and  for  Tower  2  the  current  is  0.92  Z107.50  amperes. 
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Fig.  6  The  model  used  to 
numerically  simulate  performance 
for  MF  broadcast  station  KGGN 
890  kHz,  Gladstone,  MO,  which 
uses  an  elevated  radial  wire  system. 
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The  proof  of  performance  documents  field  strengths  for  the  nondirectional  (ND)  tower,  measurements  made  at  many 
locations  in  eight  azimuthal  directions,  in  the  distance  range  0.3  to  40  kilometers;  and  in  the  azimuthal  direction  of 
maximum  gain"  the  field  strengths  for  the  ND  tower  and  directional  array  were  measured  at  the  same  locations,  in  the 
distance  range  3  to  40  km. 

Gladstone  is  NNE  of  Kansas  City,  MO.  Notice,  see  Fig.  7,  that  excepting  for  80.5°  T  and  138°  T,  the  radial  paths  are 
all  over  rather  rough  terrain.  Some  of  these  radial  paths  pass  through  residential  and  urban  areas.  Clearly  this  should  result 
in  a  scatter  in  the  field  strengths  measured.  And  indeed  there  is  a  scatter,  not  shown  here,  which  is  too  great  to  validate  a 
comparison  with  computed  field  strengths. 


Fig.  7  Map  showing  terrain  features 
surrounding  the  MF  broadcasting  station 
KGGN  890  kHz,.  Gladstone,  MO, 
highlighting  three  azimuthal  directions 
pertinent  to  our  analysis  for  comparison 
with  the  measured  data. 


In  Fig.  8  I  show  a  plot  of  FS  for  the  ND  tower  with  elevated  radials  vs.  distance  for  the  two  azimuthal  directions, 
80.5°  T  and  138°  T,  which  are  the  only  two  paths  that  lie  over  rural  fairly  open  and  level  land  (see  Fig.  7).  In  retrospect 
we  should  have  plotted  only  the  data  for  the  80.5°  T  azimuth.  The  computed  curve  (the  continuous  line)  has  been 
calculated  for  average  land  (a  =  5  mS/m,  £  =  13),  a  conductivity  which  is  consistent  with  the  measurements.  I  have  not 
fitted  a  curve  to  the  measured  data.  Notice  that  the  measured  data  points  cluster  quite  well  about  the  theoretical  curve.  I 
conclude  that  indeed  NEC-4D  accurately  predicts  the  ground  wave  field  strength. 


Now  let  us  look  at  measured  field  strengths  along  the  azimuthal  direction  corresponding  to  the  main  beam  direction 
(249°  T)  for  the  directional  array.  In  Fig.  9  I  show  the  results  for  the  ND  tower  in  this  direction  —  again  the  continuous 
curve  is  the  theoretical  curve  (the  same  curve  in  Fig.  8),  and  the  filled  in  circles  the  measured  data.  Notice,  we  will  make 
reference  to  this  below,  the  sudden  break  in  the  agreement  between  measured  and  predicted  field  strengths  in  the  distance 
range  4-10  km.  This  is  considered  to  be  due  to  the  rough  terrain  over  which  the  wave  passes  to  reach  the  measurement 
locations. 


In  this  same  figure  we  compare  the  theoretical  field  strength  curve  for  the  directional  pattern  (the  continuous  line)  with  the 
measured  data  (the  open  circle  data  points).  Again,  notice  that  the  data  points  in  the  range  4-10  km  lie  below  the  theoretical 
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Distance  (kilometers) 


Fig.  8  Measured  ND  field  strengths  versus  distance 
(open  circles)  for  KGGN,  for  the  two  best  azimuthal 
directions  (80.5  degrees  T  and  138  degrees  T)  —  see 
map  in  Fig.  7.  The  continuous  curve  is  the 
theoretical  field  strength. 


curve.  In  fact  the  only  data  point  that  lies  close  to  the  theoretical  curve  is  that  measured  at  3.3  km;  which  fortunately  is  the 
location  where  the  field  strength  for  the  ND  tower  is  rather  close  to  the  theoretical  curve. 

You  may  consider  that  this  is  not  very  satisfactory  agreement  between  measurement  and  numerical  modelling.  So  let  us 
compare  measurement  with  theory  in  another  way.  The  array  gain,  gain  over  the  ND  tower,  can  be  quite  accurately 
determined  from  the  ratio  of  measured  field  strengths  (since  ND  and  directional  field  strengths  were  measured  at  the  same 
locations)  in  the  distance  range  3-12  km  (nine  values),  see  Fig.  9.  The  median  value  for  the  measured  array  gain  is 
3.8  dB .  Our  predicted  array  gain,  is  4  dB.  We  can  also  compare  the  front/back  ratio  for  the  array,  predicted  vs.  measured, 
28.9  dB  compared  with  28.8  dB. 


Fig.  9  Measured  ND  field  strength  versus 
distance  (closed  circles)  for  KGGN  for  the  azimuth 
of  maximum  gain  (249  degrees  T);  and  field 
strength  for  the  2-tower  directional  array  (open 
circles).  The  continuous  curves  are  the  theoretical 
field  strengths. 


Effect  of  Ground  Conductivity 

Certainly  ground  conductivity  has  a  dominating  effect  on  antenna  performance  and  on  propagation.  I  have  calculated  (using 
EZNEC  pro  4D)  parameters  for  a  1000  kHz  quarter  wave  monopole  with  4  resonant  radials,  for  two  radial  heights  3m  and 
5m;  and  for  120  quarter  wavelength  buried  radials  (depth  -0.2m)  —  for  two  conductivities,  good  ground 
(o  =  5mS/m,  e  =  13);  and  very  poor  ground  (a  =  0.5  mS/m,  e  =  3);  see  Table  1.  The  gain  G  is  the  space  wave  gain. 
The  field  strength  at  1  km  E  (1  km)  is  the  predicted  ground  wave  field  strength  at  that  distance,  for  a  measurement  height  of 
1 ,5m.  and  a  transmitter  power  of  1000  watts. 
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For  the  very  poor  conductivity  case,  be  careful  how  you  interpret  the  field  strength  at  1  Ian.  In  the  distance  range  200m  to 
1000m  the  field  strensth  falls  off  on  a  linear  curve  on  a  log-log  plot,  but  faster  than  1/distance.  If  I  take  the  field  strength 
at  200m  as  an  unattended  value,  and  assume  that  the  field  strength  falls  off  as  1/d  (unattenuated  value),  the  unattenuated 
field  strengths  at  1  km  for  radial  heights  5m,  3m  and  -0.2m  (120  radials)  are  254  mV/m,  249  mV/m  and  263  mV/m. 
Clearlv  propagation  effects  dominate.~even  at  distances  <  1  km.  But  proof  of  performance  measurements  do  not  always 
show  measurements  so  close  in  to  the  antenna  —  and  for  a  multi-tower  antenna  array  this  distance  range  is  too  close  in  — 
the  beam  is  not  formed. 

Conclusions 

In  conclusion,  if  we  wish  to  seek  a  comparison  between  measurement  and  simulation  for  a  monopole  over  sparse  radial 
wire  around  systems,  we  will  need  to  conduct  this  experiment  in  an  open  field  site,  using  modem  measurement  equipment.. 
And  "the  ground  conductivity  must  be  measured.  Field  strength  is  strongly  influenced  by  the  local  environment,  such  as 
power  lines  and  fences.  The  measurements  clearly  reveal  the  difficulty  in  determining  an  accurate  field  strength  vs.  distance 
curve  with  little  scatter,  needed  for  comparison  with  numerical  modelling,  which  assumes  a  flat  smooth  earth.  Only  if 
sufficient  detail  is  given  in  the  proof  of  performance  to  sort  the  data  and  establish  that  indeed  there  can  be  a  rather  good 
agreement  between  measured  and  predicted  field  strengths  are  the  measured  data  useful  for  validating  computer  simulation. 
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Table  1 

Effect  of  ground  conductivity,  see  text  for  details. 


Height 

Ground 

Conductivity 

(number) 
of  radials 

Good 

Very  Poor 

5m 

(4  radials) 

Za 

Gain 

E  (1  km) 

38.5 -j  1.7 
1.89  dBi 

293.5  mV/m 

37.6 -j  11 
-1.85  dBi 

181  mV/m 

3m 

4  radials) 

Za 

Gain 

E  (1  km) 

40.2 

1.8  dBi 

288.5  mV/m 

40.1  -  j  10.4 
-1.99  dBi 

176.4  mV/m 

-0.2m 

(120  radials) 

Za 

Gain 

E(lkm) 

37.7 +j  9.4 
1.9  dBi 

292.7  mV/m 

36.5  +j  5.6 
-1.4  dBi 

188.8  mV/m 
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ABSTRACT 

For  successful  mission  operations,  the  U.  S.  Coast  Guard’s  new  87-foot  coastal 
patrol  boat  relies  on  effective  radio  communications  systems.  The  Department  of 
Engineering  at  the  U.  S.  Coast  Guard  Academy  has  been  tasked  to  assess  the  high 
frequency  (H.  F.)  communications  system  on  these  patrol  boats.  Radiation  patterns  are 
predicted  using  Numerical  Electromagnetic  Code  (NEC).  Personnel  radiation  hazard 
potential  is  evaluated  through  on-site  measurements,  and  verified  using  computer 
modeling  with  NEC  near-field  calculations.  Results  of  these  measurements  are 
presented,  which  indicate  that  exposure  levels  are  below  current  I.E.E.E.  standards. 
Antenna  placement  is  also  evaluated  using  NEC. 

1.0  INTRODUCTION 

During  the  summer  of  1998,  the  United  States  Coast  Guard  began  accepting 
delivery  of  a  new  class  of  coastal  patrol  boats.  The  new  cutters  are  87  feet  long,  and  will 
be  used  as  replacements  for  the  aging  fleet  of  82-ft,  95-ft,  and  110-ft  patrol  boats.  When 
the  construction  project  is  completed,  some  50  new  cutters  will  have  entered  the  fleet  of 
high  utility  search  and  rescue,  environmental  enforcement,  and  marine  safety  cutters. 
Because  of  the  diverse  missions  for  which  these  cutters  are  used,  it  is  imperative  that 
the  communications  suite  performs  optimally.  The  Antennas  group  of  the  Coast  Guard 
Academy  was  tasked  with  generating  an  electromagnetic  model  of  the  new  87-ft  coastal 
patrol  boat.  Once  the  model  was  created,  we  were  asked  to  predict  the  H.  F.  performance 
of  the  cutter.  While  performing  this  analysis,  personnel  radiation  hazards  were 
assessed.  During  June  1998,  data  were  collected  aboard  the  cutter  BARRACUDA  in 
Eureka,  CA.  to  validate  the  computer  simulations.  During  the  fall  of  1998,  modifications 
to  the  ship  design  were  implemented  on  the  second  patrol  boat  to  strengthen  the  ship’s 
mast,  and  to  change  the  location  of  some  of  the  VHF  and  UHF  antennas.  These  changes 
are  presently  being  evaluated  using  a  modified  wire  grid  on  the  computer  model. 
Computer  predictions  will  be  verified  by  physical  measurements  on  one  of  the  cutters. 
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2.0  DEVELOPING  THE  MODEL 

To  develop  the  electromagnetic  model  of  the  Coastal  Patrol  boat,  we  generated  a 
two-foot  grid  from  drawings  of  the  ship.  This  grid  spacing  was  chosen  to  give  good  detail 
in  the  model,  while  limiting  the  number  of  segments  to  a  reasonable  total  (1766 
segments).  This  model  was  used  to  predict  radiation  patterns  in  the  far  field  at  several 
frequencies  in  the  H.  F.  region. 


Figure  1:  View  of  the  NEC  model  of  the  coastal  patrol  boat  containing  1766  segments  with  2-ft  grid 
spacing. 

To  assess  the  near-field  effects  on  the  inside  of  the  wheelhouse,  the  grid  size  on 
the  pilot  house  was  reduced  to  one  foot.  Also,  the  interior  deck  of  the  wheel  house  was 
added  to  the  model  using  the  one-foot  grid  spacing.  This  grid  density  increased  the  total 
number  of  segments  to  3349.  The  tighter  grid  spacing  was  selected  for  two  reasons. 
First  we  wanted  to  make  certain  that  the  windows  on  the  bridge  behaved  like  windows 
and  not  just  part  of  the  grid,  and  second,  the  closer  grid  spacing  on  the  bridge  should 
yield  better  results  for  the  near  field  calculations.  The  cost  of  evaluating  the  fields  with 
the  higher  density  grid  was,  of  course,  increased  computation  time. 

3.0  NEAR-FIELD  MEASUREMENTS 

Near-field  measurements  were  gathered  on  the  bridge  of  the  patrol  boat  in  June 
1998.  A  spatial  grid  was  established  on  the  port  side  of  the  bridge  in  the  vicinity  of  the 
windows  Measurements  of  electric  field  strength  were  recorded  at  approximately  one- 
half  meter  separation  fore  and  aft,  and  at  three  different  distances  into  the  bridge  from 
the  inside  edge  of  the  port  bulkhead.  Three  vertical  heights  were  established  above  the 
deck  of  the  wheelhouse;  1)  the  bottom  edge  of  the  windows,  2)  the  center  of  the  windows, 
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and  3)  the  top  of  the  windows.  This  grid  was  used  so  that  the  measurements  could  be 
compared  to  calculations  of  electric  field  from  the  NEC  program  and  model. 


Figure  2:  View  of  the  NEC  model  containing  3349  segments  with  the  one-foot  grid  spacing  on  the 
wheelhouse. 

Near-field  measurements  were  made  using  a  Hewlett-Packard  HP11941A  close- 
field  probe  (H-field  loop)  and  HP  8595A  Spectrum  Analyzer.  To  simplify  data  collection 
and  recording,  a  simple  data  acquisition  program  was  written  to  record  the  x,  y,  z 
coordinates,  and  signal  strength.  The  coordinates  were  typed  into  the  lap-top  computer, 
and  the  field  strength  was  captured  from  the  spectrum  analyzer  over  the  Hewlett- 
Packard  interface  bus  (HPIB)  with  a  PCMCIA  HPIB  interface  unit.  The  programming 
environment  used  for  this  data  acquisition  was  MATLAB®  MATLAB®  was  selected 
because  of  the  ease  of  manipulating  and  plotting  the  data  in  this  environment.  Data 
were  recorded  as  arrays  of  numbers.  Each  array  element  included  coordinates  of  the 
datum  point  and  the  signal  strength  observed  at  that  point.  The  data  were  recorded  at 
every  point  on  the  measurement  grid  at  each  of  four  frequencies;  4012  kHz,  6022  kHz, 
7547  kHz  and  12,560  kHz.  These  frequencies  were  selected  as  representative  Coast 
Guard  operating  frequencies  in  the  H.  F.  spectrum.  We  were  especially  interested  in  the 
radiation  levels  inside  the  bridge  on  the  port  side  of  the  ship.  This  is  the  area  of  the 
bridge  where  the  operators  will  be  spending  most  of  their  time  while  the  ship  is 
underway.  Specifically,  the  commanding  officer’s  chair  is  just  inside  the  wheel-house 
next  to  the  H.  F.  whip.  The  navigator  stands  on  the  port  side  of  the  control  console  just 
forward  of  the  Captain’s  chair,  and  the  helmsman  sits  amid-ship  just  forward  of  the 
location  of  the  H.  F.  whip.  The  H.  F.  transceiver  control  unit  is  to  the  left  of  the 
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Captain’s  chair  on  the  side  of  the  chart  table.  All  interior  surfaces  on  this  ship  are 
aluminum. 

The  measured  signal  strength  levels  presented  in  Figure  3  are  based  on  a 
continuous  transmitter  power  of  100  watts  continuous  carrier  wave  into  the  antenna 
coupler.  This  represents  a  worst  case  signal  level  from  the  point  of  view  of  radiation 
hazards.  On  this  class  of  vessel  in  normal  Coast  Guard  operations,  all  transmissions  are 
single  side-band  (SSB)  transmissions  in  the  H.  F.  band.  The  duty  cycle  of  SSB  is 
normally  something  on  the  order  of  twenty-five  percent  of  the  maximum  transmitter 
power.  In  Figure  3,  the  level  representing  the  IEEE  standard  for  maximum  exposure 
limit  is  based  on  an  exposure  time  of  six  minutes,  and  a  constant  signal  level.  Thus,  the 
actual  exposure  level  for  personnel  aboard  this  ship  will  be  significantly  below  the  levels 
that  constitute  a  hazard.  However,  as  ethical  engineers,  it  is  the  opinion  of  the  authors 
that  there  is  cause  for  concern  for  the  personnel  on  the  ship,  and  that  reasonable 
attempts  to  reduce  the  radiation  levels  would  be  prudent. 

Observations  at  the  other  frequencies  showed  similar  behavior  as  the 
measurement  point  moved  around  on  the  test  grid.  Higher  frequencies  tended  to 
produce  more  points  where  the  levels  observed  exceeded  the  existing  IEEE  standard 
exposure  limits.  Again,  adjusting  the  levels  for  duty  cycle,  there  is  never  a  frequency  or 
position  on  the  ship  where  the  levels  represent  eminent  danger  to  personnel,  but  indicate 
that  engineers  should  take  steps  to  reduce  potential  exposure  levels  whenever  possible. 

4.0  NEAR-FIELD  CALCULATIONS 

To  calculate  the  near-field  strengths  inside  the  bridge  area,  points  matching  the 
measurement  grid  were  used  with  the  NEC  near-electric  field  commands.  Computer 
runs  were  initiated  for  each  frequency  and  over  the  grids  points  actually  observed.  The 
results  of  these  computations  were  then  compared  to  the  measurements  discussed  in  the 
previous  section.  Figure  4  presents  the  results  of  one  of  these  simulations  at  4.0  MHz. 
Additional  results  are  pending,  and  will  be  available  at  the  ACES  conference  in 
March  1999. 


5.0  H.  F.  ANTENNA  PLACEMENT 

The  H.  F.  antennas  on  this  patrol  boat  are  standard  23-foot  fiberglass  whip 
antennas  fed  at  the  bottom.  The  original  placement  of  these  antennas  was  on  either  side 
of  the  pilothouse.  The  actual  whip  is  mounted  on  a  pedestal  approximately  three  inches 
away  from  the  side  of  the  pilothouse.  Operationally,  the  H.  F.  antenna  coupler  would  not 
tune  properly  in  the  frequency  range  of  seven  to  ten  megahertz.  We  have  been  asked  to 
evaluate  this  problem,  and  also  evaluate  the  performance  of  the  antenna  mounted  on  top 
of  the  forward  storage  locker.  This  work  is  just  beginning  at  this  writing.  We  will  have 
results  to  present  at  the  ACES  conference  in  March  1999. 
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6.0  CONCLUSIONS 


The  H.  F.  communications  system  onboard  the  U.  S.  Coast  Guard’s  new  87-foot 
coastal  patrol  boat  has  been  evaluated.  Using  both  computer  modeling  and  physical 
measurements,  radiation  levels  inside  the  bridge  of  the  ship  have  been  found  to  be  safe 
in  that  the  levels  are  below  the  hazard  levels  set  forth  in  the  existing  IEEE  standards. 
Additionally,  through  computer  modeling,  the  radiation  characteristics  of  the  H.  F. 
antennas  have  been  evaluated.  As  a  result  of  this  work,  the  location  of  the  H.  F. 
transmit  antenna  has  been  moved  to  a  more  favorable  location. 


Radiation  @  6022  kHz  Measured  at  height  of  center  of  windows 


Distance  FWD  (ft) 


— x=  4*  into  bridge  —  Hi  —  x-13‘  into  bridge  --s  -  -x=36*  into  bridge  —  -K  —  Standard  C95.1  | 


Figure  3.  Graph  showing  near  electric  field  strength  measurements  on  the  interior  of  the  port  side  of  wheel 
house  at  a  height  of  the  center  of  the  windows.  Three  distances  into  the  bridge  are  shown.  Also  included 
is  a  line  showing  the  maximum  radiation  limit  at  a  frequency  of  6022  kHz  according  to  IEEE  Standard 
C95.1.  All  signal  levels  shown  assume  100  watts  of  R.  F.  power  at  the  antenna  coupler  input. 


7.0  ACKNOWLEDGEMENTS 

We  wish  to  thank  our  sponsors  for  supporting  this  work.  The  USCG  Research  and 
Development  Center,  Groton,  CT.  provided  the  initial  project  idea,  and  supplied  funding 
for  the  measurements  conducted  on  Barracuda.  The  USCG  Telecommunications  and 
Information  Systems  Command  (TISCOM),  Alexandria,  VA.  supported  the  development 
of  the  computer  model.  Finally,  the  Commandant,  office  of  acquisitions  supported  the 
work  involving  moving  of  the  HF  antenna. 


453 


1.  "IEEE  Standard  for  Safety  Levels  with  Respect  to  Human  Exposure  to  Radio 
Frequency  Electromagnetic  Fields,  3  kHz  to  300  GHz,  IEEE  C95. 1-1991,'  published  by 
the  Institute  of  Electrical  and  Electronics  Engineers,  Inc.,  New  York,  NY,  1992. 

2.  Burke,  Gerald  J.,  "Numerical  Electromagnetics  Code  -  NEC-4  Method  of  Moments, 
Part  I:  User’s  Manual,"  January  1992,  UCRL-MA-109338. 


Principle  Author  may  be  contacted  at: 

Michael  E.  McKaughan 
U.  S.  Coast  Guard  Academy 
Academic  Division  (de) 

27  Mohegan  Avenue 

New  London,  CT.  06320-8101 

860.444.8538  (Voice)  860.444.8546  (FAX) 

email:  mmckaughan@exmail.uscga.edu 


454 


Extending  a  MoM/PO/UTD  Hybrid  Method  by  an 
Automatic  Selection  of  the  Computational  Methods 
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Abstract  —  We  present  a  neural  network  based  approach  which  automatically  assigns  one  of 
the  computational  methods  MoM,  PO  or  UTD  to  a  specific  region  of  a  scattering  body,  so  that 
a  subsequent  radiation  or  scattering  analysis  can  be  performed  by  a  hybrid  method  combining 
these  techniques.  The  automatic  selection  is  based  on  geometric  considerations  and  allows  users 
who  are  not  familiar  with  the  theoretical  background  and  limitations  of  the  different  numerical 
techniques  to  successfully  apply  the  hybrid  method  to  a  variety  of  CEM  problems. 


1  Introduction 

Highly  advanced  numerical  methods  and  powerful  computer  codes  are  available  today  for  the 
computation  of  electromagnetic  fields.  There  is  also  a  trend  towards  hybrid  methods,  which 
combine  different  numerical  techniques  in  order  to  extend  the  range  of  applicability.  However, 
it  should  be  avoided  that  such  computer  codes  can  be  used  only  by  well  trained  and  specialized 
persons.  For  instance,  EMC  engineers  or  antenna  designers  should  be  able  to  concentrate  on  the 
solution  of  their  specific  problems  and  should  have  CEM  (computational  electromagnetics)  tools 
at  their  disposition  in  order  to  find  an  optimal  solution  or  to  gain  physical  insight.  They  cannot 
be  expected  to  be  familiar  with  all  the  details  and  limitations  concerning  the  applicability  of  e.g. 
MoM  (method  of  moments),  FDTD  (finite  difference  time  domain),  FEM  (finite  element  method), 
UTD  (uniform  theory  of  diffraction),  or  PO  (physical  optics).  A  basic  requirement  of  all  CEM 
codes  should  be  to  give  some  feedback  to  the  users  and  to  warn  them  when  some  of  the  basic 
assumptions  are  violated,  e.g.  when  the  UTD  is  applied  to  an  electrically  small  body. 

In  this  paper  we  consider  the  computer  code  FEKO,  which  represents  an  implementation  of 
a  MoM/PO/UTD  hybrid  method  (of  course  any  of  these  techniques  can  be  used  alone).  For 
unexperienced  users  it  is  a  difficult  task  to  determine  the  appropriate  method  to  be  used  for 
specific  regions.  Therefore  an  automatic,  neural  network  based  selection  is  presented.  The  more 
general  application  of  artificial  intelligence  and  expert  systems  for  CEM  was  already  discussed 
in  [1-3]. 

In  Section  2  we  introduce  the  basic  idea  of  the  hybrid  method.  A  general  representation  of  the 
geometry  is  described  in  Section  3,  and  Section  4  focuses  on  the  parameter  extraction  and  the 
application  of  a  neural  network.  Some  examples  are  presented  in  Section  5. 
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2  Description  of  the  hybrid  method 


The  general  idea  of  the  hybrid  method  can  be  explained  with  the  help  of  the  symbolic  representa¬ 
tion  in  Fig.  1.  If  the  radiation  from  the  antenna  on  the  roof  of  a  car  shall  be  investigated  at  a  high 
frequency,  one  possible  approach  might  be  to  model  the  antenna  as  well  as  a  small  area  around 
the  antenna  with  the  accurate  but  time  and  memory  consuming  MoM.  The  remaining  parts  of 
the  roof  are  taken  into  account  by  PO  or  the  IPO  variant  (improved  PO  with  correction  terms, 
e.g.  [4])  and  the  other  parts  of  the  car  are  assigned  to  the  UTD-region.  The  coupling  between  the 
different  methods  is  accurately  taken  into  account,  see  e.g.  [4-6]  for  details  of  the  hybrid  method. 

The  assignment  of  the  different  computational  methods  to  the  different  regions  requires  some  basic 
understanding  of  the  accuracy  and  limitations  of  the  methods  and  also  their  CPU-time  and  mem¬ 
ory  requirements.  The  selection,  which  strongly  depends  on  the  frequency  under  consideration, 
shall  be  automated. 


3  General  representation  of  the  geometry 

One  basic  requirement  of  the  automated  selection  of  an  appropriate  method  is  that  the  definition 
of  the  geometry  by  the  user  and  the  model  generation  are  independent  of  the  method  that  will  be 
assigned  only  later  to  a  particular  region. 

To  illustrate  our  implementation  consider  the  simple  structure  in  Fig.  2  (a)  consisting  of  two  flat 
plates  with  one  common  edge.  The  user  just  specifies  the  plates  through  the  corner  points.  If  it 
turns  out  later  that  the  UTD  shall  be  used  in  this  region,  then  the  polygonal  surfaces  remain  as 
depicted  in  Fig.  2  (a).  The  wedge,  edge  and  corner  information  for  treating  the  diffraction  effects 
is  extracted  automatically. 

On  the  other  hand,  the  MoM  and  also  our  PO  implementation  require  a  subdivision  of  the  structure 
into  electrically  small  triangular  patches.  This  is  also  done  automatically  (Delaunay  triangulation, 
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(b)  Subdivision  into  51  tri¬ 
angular  patches 


(c)  Subdivision  into  647  tri¬ 
angular  patches 


Fig.  2:  Example  for  a  simple  geometry  consisting  of  two  polygonal  shaped  flat  plates  (a)  with  automatic 
subdivision  into  triangular  patches  for  the  MoM  and  PO  at  different  frequencies  (b,c). 


e.g.  [7,8])  without  any  interaction  by  the  user  and  depending  on  the  frequency  more  or  less  triangles 
are  generated,  see  Fig.  2  (b,c). 


4  Application  of  neural  networks 

Even  though  FEKO  can  handle  dielectric  bodies  with  either  MoM  or  PO,  the  description  here 
shall  be  restricted  to  metallic  scatterers.  For  metallic  wires  the  MoM  is  always  selected,  whereas 
metallic  surfaces  are  subdivided  into  regions  (no  limitation  in  size)  and  a  unique  computational 
method  is  later  assigned  to  each  region.  For  instance  in  Fig.  1  the  hood,  trunk,  wings  etc.  might 
represent  different  regions. 

As  illustrated  in  Fig.  3,  for  each  region  a  set  of  parameters  are  determined: 

•  the  surface  A  of  the  region 

•  minimal  source  distance  d  to  the  region  (d  oo  for  plane  wave  excitation) 

•  average  radius  of  curvature  g  of  the  region 

•  surface  Apr0j  of  the  projection  of  the  region  onto  a  plane  perpendicular  to  the  direction  of 
incidence 

•  tallness  parameter  s  =  where  AcircIe  represents  the  area  of  the  smallest  circle  enclosing 
the  projection  of  the  region 

These  values  are  normalized  with  respect  to  the  wavelength  (d,  g)  or  the  square  of  the  wavelength 
(A,  Aproj),  respectively.  In  order  to  compress  the  range  (e.g.  source  distance  d  from  0  to  oo)  and  to 
obtain  useful  input  values  for  the  neural  network,  an  additional  scaling  by  an  exponential  function 
is  performed,  see  labeling  of  the  input  nodes  in  Fig.  4. 

Hence,  the  neural  network  requires  5  input  nodes.  We  use  4  output  nodes  in  order  to  represent 
the  different  computational  methods  available  in  FEKO:  MoM,  UTD,  PO,  and  IPO  [4]  or  Fock- 
currents  [9].  The  method  corressponding  to  the  highest  value  of  the  4  output  nodes  is  chosen.  For 
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IPO  and  Fock  currents  the  same  output  node  can  be  used  since  IPO  can  be  applied  only  to  fiat 
surfaces,  whereas  Fock-currents  are  used  on  convex  curved  elements. 

Investigations  with  SNNS  (Stuttgart  Neural  Network  Simulator  (10])  have  led  to  the  non-recursive 
network  topology  as  shown  in  Fig.  4  with  2  hidden  layers  and  10  nodes  per  layer.  The  nodes  of 
adjacent  layers  are  fully  meshed.  The  network  was  trained  by  the  standard  back-propagation 
and  the  resilient  back-propagation  algorithms  with  data  obtained  from  the  analysis  of  numerous 
radiation  and  scattering  problems.  We  created  two  different  sets  of  weighting  coefficients  for: 

•  high  accuracy,  long  run-times  can  be  tolerated 

•  moderate  accuracy  (e.g.  for  certain  EMC  investigations)  but  emphasis  on  low  memory  re¬ 
quirement  and  CPTJ-time 


surface  of  the  region:  1  —  e  so*5 

Aproi 

surface  of  the  projection:  1  —  e  so 
distance  to  the  source:  1  —  e-is* 
tallness  parameter:  s 
x 

curvature  measure:  1  —  e  100  e 


MoM 


PO 

IPO/Fock 

UTD 


Fig.  4:  Application  of  a  neural  network  to  the  automatic  selection  of  a  computational  method  for  a  region 
and  meaning  of  the  input  and  output  nodes. 
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5  Examples  for  the  automatic  selection  of  the  computational 
method 


The  first  example  is  depicted  in  Fig.  5:  The  monostatic  RCS  of  a  perfectly  conducting  cube  with 
an  edge  length  a  shall  be  determined  over  a  wide  frequency  range  so  that  0.1  <  j  <  7  with  the 
wavelength  A.  The  geometry  is  very  simple  and  the  6  faces  of  the  cube  represent  one  region  which 
is  uniformly  treated  by  either  MoM,  PO,  IPO  or  UTD.  It  should  be  noted  that  even  though  the 
single  faces  are  flat,  the  average  radius  of  curvature  g  of  the  whole  region  is  finite. 

For  accurate  RCS  computations  the  neural  network  trained  for  a  high  accuracy  should  be  used. 
It  proposes  the  MoM  up  to  the  limit  f  =  2.53  and  the  number  of  triangular  patches  increases 
continuously  from  24  for  j  =  0.1  up  to  5816  at  the  limit  |  =  2.53.  For  higher  frequencies, 
the  neural  network  suggests  to  use  the  IPO  and  the  preprocessor  automatically  generates  the 


Fig.  6:  Monostatic  RCS  a  of  the  cube  as  a  function  of  the  normalized  edge  length  a. 
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normalized  cube  size  | 

Fig.  7:  Memory  and  CPU-time  requirement  for  the  monostatic  RCS  computation  of  the  cube  for  (Pentium 
PC  266  MHz). 

corresponding  input  file  for  FEKO.  Even  for  higher  frequencies  the  IPO  remains  the  method 
being  chosen,  since  the  neural  network  was  trained  to  know  that  we  have  not  yet  implemented 
the  caustic  corrections  for  the  UTD  required  in  this  case  for  far-field  computations  with  a  plane 
incident  wave. 

In  Fig.  6  we  have  plotted  the  computed  RCS  values  and  compare  them  to  measurements  from  [11]. 
The  cost  in  memory  and  CPU-time  for  the  solution  is  depicted  in  Fig.  7. 

The  second  example  deals  with  a  monopole  antenna  of  height  h  =  83.28  mm  mounted  on  a  square 
plate  with  an  edge  length  of  a  -  1  m  as  shown  in  Fig.  8.  The  input  admittance  Y  of  the  monopole 
antenna  as  a  function  of  the  frequency  /  is  shown  in  Fig.  9,  where  three  methods  are  applied:  MoM 
alone  as  reference  solution.  MoM/PO  hybrid  method  (see  Fig.  8  for  regions),  and  the  MoM/UTD 
hybrid  method  with  MoM  for  the  wire  antenna  and  UTD  for  the  plate. 

Applying  the  neural  network  trained  for  high  accuracy  leads  to  an  application  of  the  MoM  up 
to  1123  MHz,  for  higher  frequencies  the  MoM/UTD  combination  is  proposed.  It  is  interesting 
to  note  the  behavior  of  the  neural  network  trained  for  moderate  accuracy  but  short  run-times 
and  low  memory  consumption:  The  MoM  is  selected  only  for  frequencies  lower  than  167  MHz 


Fig.  8:  Monopole  antenna  of  the  height  h  =  83.28  mm  with  a  wire  radius  g  =  0, 3331  mm  in  the  center  of 
a  square  metallic  plate  with  an  edge  length  of  a  =  1  m.  Shown  is  a  possible  arrangement  for  the 
combination  of  MoM  and  PO. 


460 


Fig.  9:  Real  and  imaginary  part  of  the  input  admittance  Y  of  the  monopole  antenna  on  a  metallic  plate 
as  a  function  of  the  frequency  /. 

corresponding  to  j  —  0.56.  In  the  frequency  range  167  MHz  <  f  <  313  MHz  (0.56  <  f  <  1-04) 
the  hybrid  combination  of  MoM  and  IPO  is  proposed,  and  for  higher  frequencies  the  neural 
network  decides  to  use  MoM/UTD.  Even  though  already  for  f  =  0.56  high  frequency  methods 
are  involved,  a  good  accuracy  can  be  obtained  as  demonstrated  in  Fig.  10.  Independent  of  the 
frequency,  a  solution  takes  less  than  2  sec  for  a  single  frequency  on  a  Pentium  PC  with  166  MHz. 


frequency  /  in  MHz 

Fig.  10:  Magnitude  of  the  input  admittance  Y  of  the  monopole  antenna  on  a  metallic  plate  as  a  function 
of  the  frequency  /. 
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6  Conclusions 


The  application  of  a  neural  network  to  automatically  select  suitable  numerical  methods  for  the 
analysis  of  electromagnetic  radiation  and  scattering  problems  has  been  discussed.  The  examples 
presented  are  based  on  the  computer  code  FEKO,  where  MoM,  PO  and  UTD  are  available. 
Two  examples  were  presented  which  clearly  show'  how  the  MoM  used  at  lower  frequencies  is 
automatically  replaced  by  the  asymptotic  methods  or  hybrid  combinations  at  higher  frequencies. 
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Introduction 

In  1982 1  wrote  an  article  [1]  on  umbrella  top  loaded  antennas,  based  on  an  extensive  experimental  model  study  of  base-fed 
antennas.  However,  in  that  article  I  described  a  method  which  could  be  used  to  feed  an  antenna  with  umbrella  wires 
suspended  from  a  grounded  tower,  by  feeding  the  tower  as  a  top-loaded  folded  unipole  —  and  this  method  of  feed  became  the 
title  of  the  article  (editor’s  choice).  Recently  an  interest  in  folded  umbrella  antennas  has  resurfaced  [Bob  Eldridge,  VE7BS, 
private  communications,  August  1998].  Apparently  some  of  VE7BS's  New  Zealand  friends  from  "down  under”  (VK3ZL 
and  VK6UZ)  have  landed  on  my  Ham  Radio  article  and  were  asking  him  questions  about  the  folded  version. 

This  paper  is  a  result  of  my  revisiting  umbrella  top-loaded  antennas  by  computer  simulation,  which  prompted  a  meticulous 
search  of  my  files.  This  search  brought  to  light,  in  my  view,  an  interesting  version  of  umbrella  antennas.  The  "interesting 
version"  of  this  type  of  antenna  was  devised  by  Carl  E.  Smith,  now,  in  1998  a  silent  key,  which  is  an  umbrella  antenna 
with  a  tuned  insulated  counterpoise  [2],  The  subject  of  elevated  radials  is  a  topic  that  has  interested  me  for  a  number  of 
years,  beginning  (insofar  as  published  papers  is  concerned)  with  my  paper  presented  in  the  Amateur  Radio  Session  of  the 
1996  ACES  Conference  [3], 

The  Umbrella  Antenna 

Electrically  short  tower  performance  can  be  improved  by  top  loading,  which  increases  the  radiation  resistance,  and  lowers 
the  base  reactance.  The  umbrella  antenna  is  one  method  to  top  load  a  tower,  without  the  need  to  install  additional  towers  to 
support  the  top  loading.  This  antenna  is  illustrated  in  Fig.  la.  The  top  loading  consists  of  a  number  of  active  umbrella 
wires,  connected  to  the  top  of  the  tower,  and  strung  obliquely  to  ground.  The  important  parameters  for  such  an  antenna  are 
the  height  h,  of  the  tower;  the  horizontal  distance,  d,  from  the  base  of  the  tower  to  the  extremities  of  the  insulated  guys 
supporting  the  conducting  umbrella  wires;  and  the  vertical  distance,  s,  from  the  top  of  the  tower  to  the  height  where  the 
umbrella  wires  are  broken  by  an  insulator. 


Fig.  1  Sketches  illustrating  phasing  of  currents  on  umbrella  antenna,  T-  and  L-type  radiators.  The  sketch  in  (a) 
illustrates  insulated  base  feed;  that  in  (d)  a  grounded  tower  folded  umbrella  antenna. 


The  sketches  in  Figs,  la,  lb  and  lc  illustrate  the  phasing  of  the  currents  on  various  top-loaded  vertical  antennas,  the 
umbrella  antenna,  the  T-,  and  the  inverted  L-type  antennas;  thus  illustrating  the  difference  between  these  types  of  radiators. 
In  the  case  of  the  T-  and  inverted  L-type  radiators,  the  currents  on  the  flat  top  and  on  the  vertical  part  of  the  antenna  system 
are  not  opposing,  since  these  currents  are  on  elements  that  are  orthogonally  oriented  (in  space).  Also,  recall,  considering 
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the  antenna  and  its  image  in  the  ground,  that  only  the  current  on  the  vertical  part(s)  of  the  antenna  contributes  appreciably 
to  radiation.  The  currents  on  the  flat-top  and  on  the  image  of  the  flat-top  in  the  ground  are  in  phase  opposition,  and  so 
essentially  cancel  insofar  as  radiation  is  concerned.  The  currents  on  the  vertical  part(s)  of  the  radiator  and  its  image  in  the 
ground  are  in  phase. 

In  the  case  of  the  umbrella  antenna,  however,  the  currents  on  the  umbrella  wires  have  a  vertical  component  that  is 
oppositely  directed  to  the  current  on  the  tower;  and  therefore  radiation  from  the  top  part  of  the  tower  over  the  distance  s  and 
from  the  umbrella  wires  partially  cancel. 

Thus,  while  the  capacitive  reactance  of  the  top  hat  increases,  and  the  base  reactance  decreases,  as  the  length  and  number  of 
umbrella  wires  increase;  the  radiation  resistance,  Rr,  first  increases,  and  then  decreases.  The  radiation  resistance,  in  accord 
with  our  experimental  model  study,  is  a  maximum  when  s/h  =  0.43. 

The  Folded  Umbrella  Antenna 

The  sketch  in  Fig.  ID  illustrates  an  umbrella  antenna  employing  a  grounded  tower,  which  is  configured  as  a  folded 
element.  This  arrangement  can  be  designed  to  raise  the  impedance  of  the  antenna  to  the  value  required  by  the  feeder.  The 
latter  may  then  be  connected  directly  to  the  antenna,  if  the  length  of  the  umbrella  wires  are  adjusted  for  resonance,  without 
the  need  for  a  matching  unit  The  tower  is  grounded  at  its  base,  and  a  cage  of  wires  surround  the  tower,  connected  to  the 
top  of  the  tower,  and  insulated  at  the  bottom.  The  feeder  is  connected  directly  to  the  bottom  of  this  cage  of  wires,  to  a  skirt 
wire  that  joins  the  ends  of  this  cage  of  wires. 

Nolan  [4]-has  used  such  an  antenna  system  for  MF  broadcasting,  but  he  provides  no  design  information,  other  than  that  the 
height  of  the  tower  can  be  short  (0.1  X),  and  if  4-wires  are  used  for  the  cage,  the  resonant  antenna  can  be  connected  directly 
to  the  feeder.  For  resonance,  the  capacitance  of  the  umbrella  top  hat  must  tune  with  the  inductance  of  the  mast  and  cage  of 
wires  in  parallel.  This  will  require  slightly  more  top  loading  than  for  the  base  fed  antenna,  but  how  much? 

The  Smith  Tuned  Counterpoise  Umbrella  Antenna 

Smith,  in  the  reference  cited  above  describes  an  electrically  short  antenna  system  which  employs  a  tuned  elevated 
electrically  short  radial  wire  ground  system,  see  Fig.  2.  This  is  an  interesting  antenna,  not  only  (from  my  point  of  view) 
because  it  employs  (tuned)  elevated  radials,  but,  notice  the  way  in  which  he  has  configured  the  tower.  The  effect  of  the  top 
loading  can  be  enhanced  by  adding  a  low  loss  inductance  at  the  top  of  the  tower.  This  low  loss  inductance  at  the  top  of  the 
tower  Is  achieved  by  an  insulated  conductor  inside  the  tower,  and  shorting  it  to  the  tower  to  simulate  a  short  circuited 
coaxial  line  with  an  open  end  and  inner  conductor  connected  to  the  top  loading  hat  at  the  top  of  the  tower.  I  must  say  that 
this  is  rather  clever  —  I  had  previously  not  seen  such  an  arrangement.  Can  NEC-4D  (I  use  the  EZNEC  pro  4D  version) 
model  such  an  antenna  system?  We  will  address  this  below. 


Fig.  2  Top  loaded  monopole  with  insulated  counterpoise  and  buried  radial  ground  system  (after  Smith  [2]). 

The  current  distribution  on  the  top  hat  conductors  can  be  assumed  to  be  sinusoidal,  with  zero  current  at  the  outer  ends  and  a 
maximum  cuirent  at  the  top  of  the  tower  where  they  are  joined  together.  At  this  point  the  top  loading  inductance,  or  open 
end  of  a  short  circuited  coaxial  line,  in  series  can  replace  an  appreciable  portion  of  the  sine  wave,  with  the  result  that  the 
current  distribution  on  the  tower  can  be  essentially  constant. 
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At  the  bottom  of  the  tower  more  series  inductance  can  be  added  by  insulating  a  conductor  up  inside  the  tower  to  a  shorted 
point. 

The  Smith  Tuned  Counterpoise  Umbrella  Antenna  is  Modelled 

Smith  gives  some  detail  on  a  version  of  his  umbrella  top  loaded  antenna  with  a  tuned  elevated  counterpoise  in  use  at 
Kodiak,  Alaska,  station  KVOK  operating  on  560  kHz.  This  station  employs  a  tuned  insulated  counterpoise.  The  antenna 
is  a  vertical  radiator,  see  Fig.  3,  mast  height  43.5  m  (143  feet),  0.08  X;  with  umbrella  top-loading.  16  drooping  radials, 
length  38.1  m  (125  feet),  0.071  X  and  s/h  =  0.5;  fed  against  16  insulated  radial  wires,  height  3  m,  length  45.7  m  (125  feet). 
0.085  X.  The  counterpoise  is  connected  to  the  antenna  system  coupling  unit  and  to  ground  through  a  counterpoise  series 
reactance  (reference  Fig.  2)  which  is  tuned  to  maximize  the  field  strength  radiated  in  the  far  field.  Station  KVOK  employs 
a  simple  unifoim  tower. 


Fig.  3  Vertical  plan  sketch  of  tower  top 
loading  and  counterpoise  for  station  KVOK. 


Smith  describes  his  view  with  respect  to  minimizing  ground  loss.  The  E-field  between  the  top  hat  and  the  counterpoise  can 
be  likened  to  a  transmission  line  that  is  open  at  one  end  and  shorted  at  the  other.  There  is  an  E-field  between  the  top  hat 
and  the  counteipoise,  and  some  of  the  E-field  passes  through  to  the  ground  beneath.  The  counterpoise  potential  is 
maintained  so  as  to  minimize  the  current  in  the  lossy  ground.  The  counterpoise  series  reactance  (  +j  X4  in  Fig.  2,  +j  Xc 
in  our  studies).  His  antenna  employed  an  "abbreviated"  buried  radial  wire  ground  system  (but  details  were  not  given).  I 
assume,  for  our  model,  a  6  m  stake  ground. 

First,  I  have  computed  the  antenna’s  self  impedance.  For  this  umbrella  top  loaded  antenna  fed  against  a  PEC  ground, 
Za  -  6.8  +  j  145.5  ohms.  The  unattenuated  field  strength  at  1  km  Eu  =  256  mVm  for  a  transmitter  power  of  1  kW.  The 
fact  that  the  antenna  is  so  heavily  top  loaded  that  its  base  reactance  is  inductive  was  not  commented  on  by  -Smith 

With  the  16  elevated  resonant  radials  over  average  ground,  we  compute  Za  =  7.1  +  j  130.3  ohms;  and  Eu  =  255  mV/m. 
The  field  strength  Eu  (for  the  case  of  real  ground)  is  the  unattenuated  field  strength,  FS,  at  1  km  for  a  transmitter  power  of 
1  kW,  which  for  results  given  here  is  based  on  the  FS  at  200  metres,  attenuated  by  1/d  to  d  =  1000  m. 

Smith  indicates  that  the  proper  method  of  tuning  his  counterpoise  system  is  to  observe  the  far  field  zone  field  strength 
while  tuning  the  system.  Indeed,  see  Fig.  4,  dramatic  variations  in  field  strength  should  be  observed  as  the  counteipoise 
is  tuned,  which  is  in  accord  with  observation.  As  the  reactance  of  this  counterpoise  series  inductance  is  increased,  the  base 
impedance  of  the  antenna  and  the  field  strength  change  rapidly  with  increasing  reactance,  for  Xc  <  j  400  ohms;  after  which 
the  changes  with  increasing  reactance  are  rather  small,  asymptotically  approaching  limiting  values  for  a  rather  high 
reactance  value.  When  Xc  =  j  1000  ohms,  Z&  =  9  +  j  78  ohms;  and  Eu  =  234  mV/m  (a  coil  Q-factor  of  300  was  assumed). 
Smith  measured  an  antenna  resistance  of  7.2  ohms  for  his  "tuned”  antenna. 

Therefore,  the  radiation  efficiency  (according  to  NEC-4D)  for  this  tuned  counterpoise  antenna  over  real  (average)  ground 
il  =  83  percent.  This  radiation  efficiency  estimation  is  based  on  the  relation  that  the  radiated  power,  Pr,  varies  as  Eu^. 
Smith's  estimate  for  the  radiation  efficiency  is  92-percent  (based  on  his  value  for  Rr  =  6.6  ohms,  and  his  measured  antenna 
resistance  Ra  =  7.2  ohms). 

Whatever,  I  concur  with  the  conclusion  of  James  Corum  of  Corum  Associates,  Inc.  (see  reference  [2]),  that  the  Smith  low 
loss  tuned  counteipoise  structure  is  a  remarkable  addition  to  the  technology  of  electrically  small  antennas. 
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Fig.  4  Calculated  change  in 
unattenuated  field  strength  at  1  km 
(Eu)  and  antenna  system  base 
resistance  (Ra),  versus  counterpoise 
series  reactance  (+ j  Xc). 


Design  Examples  for  160M  Antennas 


In  what  follows  I  describe  modelling  of  interest  for  the  radio  amateur,  that  is  umbrella  antennas  for  the  160M  band.  I  have  ■ 
not  tried  to  optimize  parameters,  that  is  reserved  for  a  follow  on  study  for  publication  in  the  amateur  radio  literature. 


The  Smith  Tower  Arrangement 


First,  I  model  a  conventional  base  fed  tower  with  umbrella  top  loading  over  a  PEC  ground.  I  will  address  later  the  tuned 
insulated  counter  poise.  For  a  15.3  m  tower  (diameter  243  cm),  base  height  1  m;  with  8  umbrella  wires  and  s/h  =  0.43, 
d/b  =  1.4  —  NEC-4D  tells  us  that  the  antenna  is  resonate  at  1.95  MHz.  The  base  impedance  Za  =  1 1.1  ohms. 

Next  I  configure  the  tower  as  devised  by  Smith,  see  Fig.  5,  in  which  the  open  ends  of  a  short  circuited  coaxial  line  are 
employed  to  place  a  low  loss  inductor  at  the  top,  and  at  the  bottom  of  the  tower.  I  have  arbitrarily  placed  the  short  circuits 
at  1/3  h  and  2/3  h.  The  antenna  system's  base  impedance  (according  to  NEC-4D)  Za  (1.95  MHz)  =  10.4  +  j  97.2  ohms. 
As  expected  the  resonant  frequency  of  the  antenna  system  is  lowered  significantly,  to  1.59  MHz,  and  Za  =  6.1  ohms.  It 
seems  that  indeed  NEC-4D  does  model  this  antenna,  but  I  have  no  experimental  data  with  which  to  compare. 


In  my  1982  Ham  Radio  article  I  considered  a  design  example  for  a  160M  antenna,  a  frequency  of  1.815  MHz.  I  chose 
s/h  =  0.43  (which  is  the  value  for  a  maximum  in  the  radiation  resistance  Rr),  eight  umbrella  wires,  and  d/h  =  1.4.  This 
base-fed  antenna  should  be  resonant  (according  to  my  experimental  study)  when  the  tower  height  is  16.5  metres;  and  the 
radiation  resistance  Rr  =  7  ohms. 
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As  a  result  of  the  recent  interest  in  umbrella  antennas  (referenced  above),  I  have  numerically  analyzed  this  design  example. 
According  to  NEC-4D  this  antenna  should  be  resonant  at  1.84  MHz  (I  had  estimated  resonance  based  on  our  experimental 
model  studies  to  be  1.815  MHz);  and  the  antenna’s  impedance,  Za,  (no  ground  loss)  is  11.6  ohms. 

Folded  Umbrella 

When  I  wrote  the  article  on  umbrella  antennas  I  had  done  a  lot  of  experimental  modelling  of  such  antennas,  base  fed.  I  got 
caught  up  on  the  idea  of  feeding  the  tower  as  a  top-loaded  folded  unipole,  having  come  across  the  article  by  Nolan.  I  noted 
that  for  practical  use  the  antenna  system  should  be  made  self  resonant,  that  is  the  capacitance  of  the  umbrella  top-loading 
must  tune  with  the  inductance  of  the  mast  and  the  cage  in  parallel,  and  that  this  would  require  more  top  loading  than  for  the 
base-fed  antenna  —  but  I  had  no  measured  data.  If  I  feed  the  umbrella  antenna  dimensioned  as  above,  as  a  folded-unipole 
(4- wires  in  the  cage,  distance  from  the  tower  to  the  ends  of  the  cage  wires  1 -metre,  height  above  ground  1  m),  the  resonant 
frequency  is  indeed  higher,  2.122  MHz  according  to  NEC-4D;  and  the  input  impedance  equals  46.5  ohms  (assuming  no 
ground  resistance  loss). 

Clearly  more  top  loading  or  a  larger  antenna  system  (taller  tower)  is  needed  for  a  160M  antenna.  According  to  NEC-4D  the 
resonant  height  for  the  folded  umbrella  antenna,  having  parameters  as  above,  h  =  18.5  metres;  and,  assuming  a  ground  loss 
resistance  of  5-ohms  the  input  impedance  is  50  ohms. 

This  calculation  is  for  the  antenna  parameters  as  given  in  my  1982  HR  article:  i.e.  s/h  =  0.43;  d/h  =  1.4,  the  number  of  top 
loading  umbrella  wires  n  =  8.  For  h  =  18.5  m,  f0  =  1.87  MHz. 

The  2:1  50-ohm  SWR  bandwidth  is  1.815  to  1.92  MHz:  SWR  increases  more  rapidly  above  resonance  than  below.  Above 
resonance  the  antenna  is  inductive,  and  so  it  could  be  resonated  by  a  capacitor. 

Tuned  Counterpoise  Modelled 

For  the  interest  of  the  radio  amateur,  I  have  dimensioned  an  antenna  similar  to  the  one  Smith  describes  for  the  160M  band 
(frequency  1.9  MHz).  I  have  not  tried  to  optimize  parameters.  Instead  I  have  dimensioned  an  antenna  which  seems  to  me 
to  be  practical.  Sketch  it  out  and  you  will  see.  I  have  assumed  that  the  radiator  is  a  pipe  mast  5.08  cm  diameter,  having  a 
(practical)  length  of  10  m  (0.0.063  X);  s/h  =  0.5;  and  d/h  =  2.  The  base  of  the  mast  is  2.5  m  above  the  ground  —  so  that 
one  can  walk  beneath  the  radial  wire  counterpoise  system.  The  length  of  the  counterpoise  radial  wires  is  20  m  (0.126  A,). 
That  is  the  non-conducting  support  guys  for  the  top-hat  conductors  connect  to  the  2.5  m  high  posts  supporting  the  radial 
wires  of  the  counterpoise.  "Ground"  is  average  ground  (a  =  5  mS/m,  e  =  13). 

First,  for  reference  purposes,  we  determine  the  number  of  umbrella  wires  needed  to  resonant  this  antenna,  ground  based, 
PEC  ground.  For  n  equal  to  8,  16,  and  24-umbrella  wires,  the  antenna's  impedance  Za  equals  3.4  -  j  60.2  ohms  (and 
Eu  =  281.5  mV/m),  3.3  +  j  1.6  ohms  and  3.2  +  j  25.2  ohms  respectively. 

For  our  example  antenna  we  chose  n  =  8,  see  Fig.  6,  since  for  the  radio  amateur  this  is  practical  number  of  umbrella  wires 
—  hence  our  example  antenna  fed  against  the  tuned  counterpoise  wires  is  not  resonant.  As  the  reactance  of  the  tuning  coil 
(+  j  Xc  in  our  study,  +  j  X4  in  Smith's  antenna  system)  is  increased,  the  field  strength,  Eu,  first  decreases,  then  increases 
rapidly  at  First,  and  finally  more  slowly,  asymptotically  approaching  some  final  value.  For  j  Xc  =  j  1000  ohms,  Q  =  300, 
Eu  =  221 .4  mV/m,  which  corresponds  to  a  radiation  efficiency  (assuming  no  loss  in  the  antenna  matching  circuit) 
ti  =  62  percent  —  which  again  in  our  view  is  rather  remarkable  for  such  an  electrically  short  antenna.  The  antenna  system 
impedance  Zas  =  4.5  -  j  127  ohms.  The  space  wave  pattern  is  entirely  vertically  polarized  (gain  G  =  -  0.6  dBi). 

This  umbrella  antenna,  fed  against  8  resonant  elevated  radials,  length  37.55  m,  height  2.5  m,  would  have  an  impedance 
Za  =  3.5  -  j  72  ohms,  and  Eu  =  248.3  mV/m,  r\  =  78  percent.  But  radials  having  a  length  of  37.55  m  (123.2  feet)  will  not 
fit  in  a  typical  back  yard. 

Certainly  as  remarked  in  the  discussion  above,  and  in  the  discussion  following  Smith's  paper,  the  antenna  system  devised 
by  him  has  a  remarkable  efficiency  for  such  a  small  antenna,  with  an  abbreviated  ground  system. 
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Fig.  6  Modelling  a  160  M  umbrella  antenna  with  tuned  elevated  radials. 

Concluding  Remarks 

Tbe  umbrella  antenna  with  a  tuned  radial  wire  counterpoise  makes  a  practical  low  loss  160M  antenna,  which  will  fit  in  a 
typical  back  yard.  While  clearly  the  counterpoise  has  to  be  tuned,  for  maximum  FS,  "tuning"  does  not  resonate  the 
antenna.  As  the  reactance  of  this  series  reactance  +  jXc  is  increased,  the  FS  approaches  a  maximum  value,  and  the 
antenna's  resistance  approaches  the  value  it  would  have  for  a  PEC  ground  (or  the  value  it  would  have  for  the  case  where 
resonant  radials  were  employed).  But  the  antenna's  reactance  is  increased  over  the  reactance  it  would  have  over  a  PEC 
ground. 

In  my  view,  this  study  of  an  umbrella  antenna  fed  against  a  tuned  radial  wire  ground  system,  should  be  of  interest  to 
amateurs  in  radio  desiring  a  practical  antenna  for  the  160M  band  —  an  alternative  to  burying  a  large  number  of  radial  wires. 
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The  tools  available  for,  and  the  practice  of,  computational  electromagnetics  (CEM)  have 
changed  unimaginably  since  I  first  became  involved  in  the  field  in  the  late  1950s.  At  that  time 
the  computer  group  at  the  Radiation  Laboratory  of  the  University  of  Michigan,  which  I  joined  as 
an  Assistant  Research  Engineer,  consisted  of  a  team  of  female  research  assistants  who  operated 
electromechanical  calculating  machines.  The  mathematician  group  leader  served  as  the 
programmer,  designing  both  the  algorithms  and  data-flow  sheets.  Each  calculator  operator 
performed  designated  calculations,  recorded  the  results,  and  passed  the  data  sheet  to  the  next 
person  in  line.  Although  not  very  fast,  operating  at  a  "FLOP"  rate  of  perhaps  2-4/minute, 
significant  computations  were  done,  including  developing  tables  of  various  classical  functions. 
The  first  Rad  Lab  digital  computer  was  the  IBM  704,  which  was  programmed  with  patch  panels, 
and  the  mainframe  computer  at  the  University  on  which  I  learned  to  program  to  complete  my 
PhD  research  in  1965  was  the  IBM  7094. 

The  hardware  tools  available  for  CEM  have  increased  in  speed  and  storage  by  orders  of 
magnitude  in  capability,  from  the  10A5  FLOP  rate  of  the  7094  to  rates  now  on  the  order  of  10A12 
FLOPS.  Memory  has  similarly  increased  from  32k  of  36-bit  words  of  the  7094  to  GBytes  now. 
Input/output  storage  has  changed  from  punched  paper  tape,  punched  cards  and  teletype  terminals 
running  at  rates  of  a  few  Bytes/sec  to  magnetic  and  optical  disks  having  transfer  rates  of  10s  of 
MBytes/sec.  Display  systems  that  consisted  of  printer  carriage  plots  and  pen  plotters  have  been 
replaced  by  color  CRT  monitors  that  seem  likely  themselves  to  be  displaced  by  LCD  displays, 
and  color  inkjet,  laser  and  thermal-transfer  printers.  Finally,  desktop  computers  have  brought, 
what  was  just  a  short  time  ago  mainframe  power,  to  the  direct  control  of  the  user. 

Changes  in  the  software  tools  used  in  CEM  have  proceeded  apace.  Frequency-domain 
integral-equation  (FDIE)  models  made  an  appearance  in  the  early  to  mid  1960s,  followed  shortly 
thereafter  by  the  time-domain  differential-equation  (TDDE  or  FDTD).  TDIE  and  FDDE  models 
arrived  towards  the  end  of  the  decade.  Problem  sizes  of  the  early  1960s  were  typically  a  few  10s 
of  unknowns  for  FDIE  models  being  initially  limited  to  simple  objects  and  shapes  like  wires  and 
spheres.  Problems  now  routinely  modeled  using  IEs  can  involve  100s  of  thousands,  and  those 
using  FDTD  can  have  10s  of  millions,  of  spatial  unknowns.  Problem  complexity,  including  both 
geometrical  and  electrical,  has  advanced  similarly.  Recent  attention  has  shifted  towards  so- 
called  "fast"  techniques  whereby  the  reduced  complexity  of  farther  interactions  is  exploited  to 
reduce  the  impact  of  frequency-scaling  laws,  reduced-order  modeling  using  techniques  like 
model-based  parameter  estimation,  and  hybrid  techniques  that  seek  to  combine  the  best  features 
of  the  individual  approaches.  Progress  has  also  been  made  in  input/output  software,  though  this 
remains  the  weak  link  in  most  applications  in  terms  of  the  overall  cost  of  computer  modeling. 

This  presentation  will  be  a  selective  review  of  CEM  over  the  past  40  years  from  my  own 
experience  and  perspective.  I'll  highlight  what  I  consider  to  be  some  of  the  more  significant 
developments  in  both  hardware  and  software  as  they  have  affected  CEM  capabilities.  To 
conclude,  I'll  try  to  anticipate  what  the  future  might  hold  over  the  next  40  years. 
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Abstract 

Coupling  characteristics  of  cylindrical  microstrip  antennas  have  been  extensively  investigated. 
The  mutual  coupling  coefficients  between  two  cylindrical  microstrip  antennas  with  rectan^lar, 
triangular,  and  circular  patches  have  been  analyzed  theoretically  and  experimentally.  Different 
theoretical  approaches  using  full-wave  analysis,  cavity-model  method,  and  generalized 
transmission- line  model  (GTLM)  theory  are  described  in  this  article.  Comparison  of  the  measured 
data  with  theoretical  results  is  also  given. 


1.  Full-wave  Solution  of  Mutual  Coupling  Coefficient 

The  mutual  coupling  characteristics  between  two  cylindrical  rectangular  or  triangular 
microstrip  antennas  have  been  analyzed  using  a  full-wave  approach  and  a  moment-method 
calculation  [1-3].  Figs.  1(a)  and  1(b)  show  the  configurations  of  two  cylindrical  rectangular  and 
triangular  microstrip  antennas,  respectively.  The  two  probe-fed  microstnp  antennas  are  first  treated 
as  a  two-port  network  with  a  2  x  2  port  matrix  [Zp]  as  shown  in  Fig.  2.  The  relation  between  the 
port  voltages  and  currents  are  defined  as 
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where  the  superscript  p  denotes  the  port  quantities.  To  determine  the  port  impedance  matrix,  the 
input  impedance  Zf,  of  the  excited  antenna  with  the  presence  of  the  other  antenna  open-circuited 
and  the  mutual  impedance  Zf,  between  the  two  antennas  need  to  be  calculated.  It  should  be  noted 
that  since  the  measurements  of  voltages  and  currents  at  microwave  frequencies  are  difficult 
scattering  matrix  related  to  the  direct  measurements  of  incident,  reflected,  and  hansmifledwOTes  is 
used  to  completely  describe  the  multiport  network  problem.  For  a  two-port  network  shown  in  Fig. 
2,  the  scattering  matrix  [S]  can  be  evaluated  from 

[S]  =  ([Zp]-Zo[U])([Zp]+Zc[U])-',  (2) 


where  [U]  is  a  unit  matrix  of  order  2  x  2  and  Zc  is  the  characteristic  impedance  of  the  feeding  coax 
(can  be  assumed  to  be  50  £2  here).  As  to  solve  Zf,  and  ZJ,,  the  following  formulas  are  applied: 
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Zfl  =  -l,E(‘,Jp1>dv> 

Pi 

Z5,=-fEm-J<2)dv,  (4) 

where  E(I)  is  the  total  electric  field  inside  the  substrate  layer  due  to  the  probe  current  at  port  1,  and 
vpi  is  the  volume  over  the  probe  at  port  i.  With  Z\x  and  Zv2l  determined,  the  mutual  coupling 
coefficient  can  be  calculated  from  Eq.  (2)  and  given  as 


s12 =2Z5,  •z./KZf,  +ZC)2  -(Z5,)2].  (5) 

From  the  above  formulation,  the  mutual  coupling  characteristics  have  been  analyzed.  Typical 
results  versus  edge  spacing  S,  the  distance  between  edges  of  two  patches,  for  the  E-plane  and  H- 
plane  coupling  cases  are  presented  in  Figs.  3  and  4.  In  the  study  the  input  impedance  of  the  isolated 
microstrip  antenna  with  various  cylinder  radii  of  interest  is  all  adjusted  to  be  500  at  the  TMoi  mode 
excitation  (rectangular  patches)  or  the  TMi0  mode  excitation  (triangular  patches).  The  obtained  Z\{ 
is  found  to  vary  slightly  with  500,  and  the  value  of  Z\x  is  strongly  affected  by  the  cylinder  radius 
and  edge  spacing.  It  is  observed  that  for  a  large  edge  spacing,  the  coupling  decreases  more  rapidly 
with  increasing  cylinder  radius.  Different  curvature  effects  on  the  E-  and  H-plane  mutual  coupling 
are  observed.  Good  agreement  between  experiment  and  theory  is  also  seen. 

2.  Cavity-model  Solution  of  Mutual  Coupling  Coefficient 

Theoretical  formulation  using  a  cavity-model  method  has  also  been  applied  for  the  study  of 
mutual  coupling  between  cylindrical  microstrip  antennas  with  rectangular  and  circular  patches  [4, 
5).  Expressions  of  the  mutual  impedance  and  mutual  coupling  coefficient  have  been  derived,  and 
typical  numerical  results  for  the  E-plane  and  H-plane  coupling  cases  have  been  reported.  To  adopt 
the  cavity-model  theory,  the  thin-substrate  condition  is  assumed  and  the  substrate  thickness  is  also 
assumed  to  be  much  smaller  than  the  cylinder  radius  and  the  operating  wavelength.  And  owing  to 
the  thin-substrate  assumption,  the  surface-wave  effect  on  the  mutual  coupling  can  be  ignored.  It  is 
also  assumed  that  the  mutual  interaction  does  not  disturb  the  interior  field  distribution  inside  the 
cavity  below  the  microstrip  patches.  Based  on  these  assumptions,  the  mutual  impedance  between 
the  two  antennas  can  be  calculated  from 

(6) 

hl2 

where  Ij  and  I2  are  the  feed  currents  of  the  two  antennas,  Hj  is  the  magnetic  field  set  up  by  antenna 
1  on  antenna  2,  and  M2  is  the  equivalent  magnetic  current  of  antenna  2.  Based  on  the  cavity-model 
analysis,  the  mutual  impedance  for  the  E-plane  and  H-plane  coupling  cases  can  be  derived  as 
follows: 
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where  the  parameters  have  been  given  in  [4,  5]. 

With  the  mutual  impedance  obtained,  the  mutual  coupling  coefficient  S12  for  the  E-  and  H- 
plane  coupling  cases  can  then  be  calculated  from  Eq.  (5).  And  due  to  the  assumption  that  the  mutual 
interaction  does  not  disturb  the  interior  field  distribution  of  the  antenna,  the  self-impedance  is  here 
equal  to  the  input  impedance  of  the  isolated  antenna.  Thus,  in  this  case,  Eq.  (5)  can  be  reduced  to  be 
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Numerical  results  calculated  from  Eq.  (8)  have  also  been  analyzed.  Results  show  that  the 
obtained  cavity-model  solutions  of  mutual  coupling  coefficients  agree  well  with  the  full-wave 
solutions.  And  since  the  computation  of  one  cavity-model  solution  of  S12  requires  less  time  than 
that  of  the  full-wave  solution,  this  makes  the  cavity-model  analysis  useful  for  antenna  engineers  in 
related  practical  designs. 


3.  GTLM  Solution  of  Mutual  Coupling  Coefficient 


Mutual  coupling  between  two  cylindrical  rectangular  or  circular  microstrip  antennas  has  also 
been  studied  using  the  GTLM  theory  [6,  7].  As  referring  to  the  geometry  in  Figs.  1(a)  and  1(c)  and 
considering  the  patches  to  be  excited  at  their  respective  fundamental  resonant  modes,  the  two 
rectangular  or  circular  microstrip  antennas  can  be  represented  by  the  equivalent  circuits  shown  in 
Figs.  5  and  6,  respectively.  In  Fig.  5  the  two  rectangular  patches  are  modeled  as  sections  of 
transmission  lines  and  are  replaced  by  equivalent  iz  networks  of  gl5  g2,  g3  and  gj,  g2,  g3.  The 
circuit  element  ys  -  ym  is  the  total  wall  admittance  at  the  radiating  edges;  ys  denotes  the  self¬ 
admittance  and  ym  denotes  the  mutual  admittance.  To  solve  the  equivalent  circuit,  we  decompose 
the  circuit  into  three  cascade  connections  of  two-port  networks.  The  ABCD  matrix  for  a  two-port 
network  is  used  in  the  analysis  of  the  cascade  networks  and  we  have  [6] 
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The  relation  between  ABCD  and  Z  parameters  can  be  obtained  from 


'Z|j  Z12]_rA/C  (AD-BC)/C 
_Z2l  zJ-Ll/C  D/C 
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With  the  [Z]  matrix  determined,  the  mutual  coupling  coefficient  can  be  calculated  from  Eq.  (5)  with 
Zc  set  to  50Q.  As  for  the  equivalent  circuit  shown  in  Fig.  6  for  the  case  of  two  circular  patch 
antennas,  an  additional  mutual  admittance,  y^,  between  the  radiating  edges  of  the  two  circular 
patches  is  introduced  and  the  equivalent  circuit  can  be  simplified  to  be 
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where  Ij  and  I2  are  the  probe  input  currents,  and  V,  and  V2  are  the  probe  excitation  voltages; 
expressions  of  the  elements  in  the  [Z]  matrix  can  be  found  in  [7].  With  the  [Z]  matrix  obtained,  the 
mutual  coupling  coefficient  is  also  readily  evaluated  from  Eq.  (5). 

Typical  mutual  coupling  coefficients  versus  edge  spacing  for  the  E-  and  H-plane  coupling 
cases  are  shown  in  Fig.  7.  From  the  results  obtained,  good  agreement  between  the  measured  data 
and  calculated  GTLM  solutions  is  observed,  and  the  dependence  of  the  mutual  coupling  on  the 
cylinder  radius  is  observed  to  be  the  same  as  that  obtained  using  the  cavity-model  and  full-wave 
solutions. 

4.  Conclusions 

Various  theoretical  models  such  as  full-wave  formulation,  cavity-model  analysis,  and 
GTLM  theory  have  been  successfully  applied  in  the  study  of  mutual  coupling  characteristics  for  two 
microstrip  antennas  mounted  on  a  cylindrical  surface.  The  theoretical  results  obtained  for  the 
mutual  coupling  coefficients  are  in  good  agreement  with  the  experimental  results.  More  calculated 
and  measured  results  will  be  given  and  discussed  in  the  presentation. 
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(a)  (b) 


Figure  3  Calculated  and  measured  mutual  coupling  coefficients  for  cylindrical  rectangular 
patch  antennas;  h  =  0.762  mm,  =  2.98, 2L  =  6  cm,  2b<j>0  =  4  mm. 

(a)  E-plane  coupling  case,  (b)  H-plane  coupling  case. 
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Figure  4  Calculated  and  measured  mutual  coupling  coefficients  for  cylindrical 
triangular  patch  antennas;  h  =  0.762  mm,  s,  =  3.0,  d  =  6  cm. 

(a)  E-plane  coupling  case,  (b)  H-plane  coupling  case. 
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Figure  6  Equivalent  circuit  of  two  cylindrical  circular  microstrip  antennas  shown  in  Fig.  1(c). 


Figure  7  Calculated  and  measured  mutual  coupling  coefficients  for  cylindrical  circular 
patch  antennas;  h  =  0.762  mm,  S!  =  3.0,  rd  —  3.2  cm. 

(a)  E-plane  coupling  case,  (b)  H-plane  coupling  case. 
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ABSTRACT 

The  performance  of  a  dielectrically  loaded 
quadrifilar  antenna  is  assessed  using  a  brute- 
force  (Transmission  Line  Matrix)  TLM  simulation. 
The  geometry  and  operation  is  described.  The 
approach  to  casting  a  realistic  model  that  is  able 
to  advance  the  design  is  discussed  with  a  critical 
appraisal  of  the  predictive  accuracy.  The 
proposed  antenna  falls  into  the  category  of  small 
antennas  and  the  simulations  account  for  the  high 
field  concentration  in  the  dielectric  as  well  as  the 
intricacies  in  the  input  impedance  and  feed 
mechanisms. 


t.  INTRODUCTION 

The  paper  discusses  the  TLM  simulation  of  a 
novel  dielectric  loaded  quadrifilar  helix  antenna 
topology.  This  antenna  has  been  the  subject  of 
considerable  research  using  electromagnetic 
modelling  and  empirical  techniques.  Leisten  et  al 
[1,2]  has  previously  published  results  from  a 
study  of  miniaturised  quadrifilar  helix  antennas 
employing  ceramic  dielectric  materials  to 
dielectrically  foreshorten  the  resonant 
dimensions.  From  a  computational  electro¬ 
magnetics  point  of  view  this  has  proven  to  be  a 
complex  structure  to  simulate.  The  paper  will 
demonstrate  how  measures  to  simplify  the  model 
to  achieve  computational  efficiency  have 
significantly  reduced  the  predictive  accuracy  of 
the  model.  More  realistic  models  that  are  able  to 
advance  the  design  is  described  with  a  critical 
appraisal  of  the  predictive  accuracy. 

The  antenna  is  formed  as  a  metal  circuit  pattern 
that  is  printed  on  the  surface  of  a  cylindrical 
ceramic  dielectric  core.  It  offers  advantages  over 
conventional  antennas  with  regard  to  the  isolation 
of  the  currents  on  the  antenna  element  from 
those  on  the  hand-set  ground-plane  and  near 
field  detuning.  It  is  targeted  to  be  used  in  GPS 


instruments  that  are  small  enough  to  be  used  in 
cell  phone  handsets  and  also  in  Personal 
Communications  Satellite  services. 


2.  GEOMETRY  DEFINITION 

Although  there  are  now  many  forms  of  quadrifilar 
antennas  genericaliy  they  all  employ  two 
quadrature  phased  bifilar  helical  loops  which  are 
placed  at  right  angles  to  each  other.  The  structure 
of  the  dielectric  loaded  quadrifilar  helix  antenna  is 
shown  in  Figure  1.1.  This  antenna  achieves 
quadrature  phasing  by  means  of  using  two  bifilar 
helices  which  have  slightly  offset  resonant 
frequencies.  The  elements  of  bifilar  helix  1  are 
adjusted  longer  than  resonance  to  produce  an 
input  impedance  with  a  phase  angle  of  +45°  and 
the  elements  of  bifilar  helix  2  are  adjusted  shorter 
than  resonance  to  produce  an  input  impedance 
with  a  phase  angle  of  -45°.  This  strategy 


Figure  1.1:  The  structure  of  the  dielectric 
loaded  quadrifilar  helix  antenna 
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produces  the  90°shift  in  phase  between  the 
currents  in  bifilar  helix  1  with  respect  to  bifilar 
helix  2. 

In  common  with  this  configuration  the  structure 
employs  balanced  feed  at  the  top  and 
differentially  phased  lines  to  organise  orthogonal 
current  phase  in  ascending  angle  anti-clockwise 
around  the  periphery  of  the  antenna.  It 
incorporates  a  sleeve  balun  in  the  lower  portion 
that  projects  a  balanced  signal  feed  to  the  back¬ 
fire  tip  of  the  antenna.  The  size  of  the  antenna  is 
significantly  reduced  with  respect  to  its  free  space 
counterpart  due  to  the  dielectric  loading  that  is 
afforded  by  the  high  permittivity  (sr«30-90)  of  the 
dielectric  core. 

The  radiating  section  of  the  quadrifilar  antenna 
can  be  viewed  as  being  composed  of  two 
orthogonally  disposed  loops  of  nominal  length 
360°.  In  each  loop  current  passes  from  one  arm 
of  a  balanced  input  to  flow  around  about  a  path 
consisting  of  a  helical  metal  conductor  around  the 
rim  of  the  balun  or  trap  and  finally  in  the 
diametrically  opposite  helical  metal  conductor  to 
the  opposite  balanced  member  of  the  feed 
system. 


3.  THE  ELECTROMAGNETIC  MODEL 

The  main  modelling  tool  that  was  used  to  model 
the  crown-rimmed  antenna  was  a  commercial 
TLM  electromagnetic  simulator  called 
MicroStripes™  [3].  The  design  methodology  was 
to  estimate  dimensions  by  analytical  means 
based  on  measured  line  parameters  and  to 
submit  this  antenna  to  a  manual  optimisation 
regime  based  on  a  battery  of  field  solvers. 
Although  MicroStripes™  can  be  supplied  with 
programs  which  provide  a  means  for  graphical 
entry  it  was  found  to  be  more  efficient  to  use 


Figure  3.1:  Magnetic  probe  measurements  to 
indicate  conditions  for  balanced  resonance 


custom  Visual  Basic™  programs  to  transcribe 
critical  dimensions  into  the  build  language  script 
that  is  interpreted  by  MicroStripe’s  geometry 
builder.  In  this  way  it  was  straightforward  to 
submit  a  targe  number  of  optimisation  trials  to  the 
simulator  very  quickly. 

A  key  advantage  of  a  time  domain  simulator  is 
that  the  designer  does  not  need  to  have  ‘a  priory’ 
knowledge  of  the  resonance  frequency  of  the 
antenna.  Using  Fourier  transformation  certain 
indicators  of  resonance  can  be  extracted  as  an 
outcome  of  the  simulation.  The  method  that  was 
chosen  to  do  this  for  the  case  of  this  topology  can 
be  illustrated  using  a  simplified  bifilar  helix 
primitive  model.  Figure  3.1  shows  how  two 
magnetic  field  measurement  points  are  defined 
dose  to  conductors  which  demonstrate  the 
magnetic  field  magnitude  and  phase  that  is 
associated  with  the  currents  in  those  conductors. 
For  the  loop  mode  the  antenna  should  ideally  be 
perfectly  balanced  and  pass  currents  down  one 
helix  around  the  balun  rim  and  up  the 
diametrically  opposed  helix  to  complete  the  360° 
loop.  Therefore,  at  resonance  the  currents  should 
be  exactly  equal  in  magnitude  and  opposite  in 
phase  at  the  probe  positions  that  are  marked  in 
Figure  3.1.  However  in  terms  of  the  direction 
convention  accorded  to  the  H-field 
measurements  in  Figure  3.1  this  state  of  balance 
is  actually  demonstrated  by  a  cross-over  of  the  H- 
field  with  frequency  loci  when  they  are  both 
plotted  on  a  polar  plot. 

The  corresponding  model  for  the  quadrifilar 
antenna  is  more  complicated  because  the 
antenna  embodies  two  twisted  loops,  [1,2]  from 
which  four  magnetic  vectors,  which  are  indicative 
of  four  currents,  are  obtained.  The  design 


Figure  3.2:  Magnetic  field  measurements 
indicating  balanced  resonance  of  twisted  loop 
primitive 
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methodology  was  to  organise  the  two  loop  pair 
cross-over  points,  each  individually  defined  in  the 
manner  of  Figures  3.1  and  3.2,  into  two 
orthogonal  pairs  which  possess  the  same 
magnitude.  This  objective  was  visualised  as  a 
convenient  optimisation  goal. 


4.  RESOLUTION  PROBLEMS  WITH  THE 
ELECTROMAGNETIC  MODEL 

In  the  course  of  working  on  the  development  of 
the  dialectrically  loaded  quadrifilar  antenna  it  was 
found  that  unlike  the  simulation  of  it’s  component 
twisted  loop  primitives  {4,5],  the  electromagnetic 
modelling  of  the  full  quadrifilar  antenna  is  a 
challenging  task  requiring  a  very  significant 
computing  effort.  In  order  to  attain  a  good  gain 
from  a  small  antenna  (with  small  physical 
volume)  it  is  necessary  to  strive  for  high  Q-factor 
and  conversely  low  operating  bandwidth.  The 
twisted  loop  antenna  that  is  shown  in  Figure  3.1, 
when  formed  on  a  core  of  er=36  material  at  1.8 
GHz  might  typically  possess  a  3dB  fractional 
bandwidth  of  about  0.5%. 


Figure  4.1:  Phase  (<[>)  with  frequency 
responses  of  shunt  connected  half-wave 
transmission  line  of  varying  attenuation 
coefficients,  a.  (a=2x10‘5  Nepers/m) 


Another  important  feature  of  small  antennas  is 
that  they  have  low  radiation  resistance.  For 
example  the  radiation  resistance  of  the  twisted 
loop  antenna  can  accurately  be  estimated  from 
the  point  dipole  formula  because  the  dipole 
length  (cfl)  is  much  less  than  the  free  space  half 
wavelength  of  the  resonant  frequency.  For  this 
class  of  antennas  the  equivalent  dipole  is  formed 
across  the  diameter  of  the  core  between  the 
opposing  centres  of  the  helices.  For  the  case  of 
the  10mm  diameter  twisted  loop  antenna 
operating  at  1.8  GHz  the  radiation  resistance  is 
approximately  2  Ohms. 

The  quadrifilar  helix  antenna  can  be  conveniently 
considered  as  comprising  of  four  shunt  connected 


half-wave  resonators.  Given  that  each  resonator 
is  physically  a  low-loss  helical  conductor  and  that 
the  radiation  resistance  is  also  small  the 
characteristic  phase  shift  with  frequency  through 
the  resonant  condition  occurs  over  a  very  narrow 
frequency  range.  This  principle  can  be  illustrated 
by  plotting  the  phase  with  frequency  responses  of 
half-wave  shunt-connected  transmission  lines 
with  varying  values  of  attenuation  coefficient  (a). 
A  range  of  responses  of  realistic  attenuation 
coefficients  is  plotted  in  Figure  4.1: 

As  can  be  seen  from  the  data  of  Figure  4.1  the 
conditions  which  give  rise  to  an  orthogonal  phase 
difference  between  two  such  shunt  resonators,  of 
slightly  offset  resonance  frequency,  can  only 
occur  over  a  very  narrow  bandwidth  indeed.  The 


Figure  4.2:  MicroStripes™  electromagnetic 
model  of  the  GPS  dielectric  loaded  quadrifilar 
helix  antenna. 


obvious  corollary  of  this  is  that  the  antenna  must 
have  very  critical  dimensions  in  order  to  exploit 
this  mode  of  operation.  In  order  to  predict 
accurately  the  behavior  of  this  structure  using 
electromagnetic  simulation  the  resolution  of  the 
model  has  to  be  very  fine  indeed. 

The  resolution  of  the  model  is  constrained  by 
computing  power  and  certain  simulator 
capabilities.  In  this  research  the  work  was  carried 
out  on  a  battery  of  eleven  Sun  Sparc™ 
processors  and  the  computing  process  was 
refined  to  balance  the  needs  to  obtain  the 
accuracy  available  from  a  fine  enough  resolution 
meshing  and  the  need  to  collect  sufficient 
throughput  of  simulation  results  to  permit  manual 
optimisation. 

An  example  of  an  electromagnetic  model  of  a 
GPS  antenna  is  presented  in  Figure  4.2:  The 
model  of  Figure  4.2  consists  of  approximately 
365,000  cuboid  cells  and  employs  a  graded  mesh 
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and  extensive  use  of  the  MicroStripes™  ‘lumped- 
cell’  utility  to  optimise  computational  efficiency. 
The  TLM  solver  runs  this  model  in  approximately 
30  hours  of  computation  time  using  a  Sun 
Sparc20  machine  with  a  processor  clock  speed  of 
200  MHz.  The  model  has  been  used  to  computer 
optimise  a  GPS  LI  (1575.42  MHz)  circularly 
polarised  antenna  as  is  explained  in  the  next 
section.  However,  this  has  not  proven  to  be 
capable  of  supporting  the  strategy  of  using  the 
helix  current  vectors  as  optimisation  goals  as 
these  have  been  found  to  be  unrealistically 
organised  and  are  pertiaps  indicative  of 
insufficient  model  resolution. 


a 


b 


Figure  4.4:  Predicted  pattern  for  a  crown- 
rimmed  quadrifilar  helix  antenna. 

(a)  elevation 

(b)  orthogonal  cut  elevation 


Use  of  MicroStripes™  was  more  sucessful  as  a 
‘proof  of  principle’  tool  to  demonstrate  the 
feasibility  optimum  cardiod  patterns  with 
favourable  antenna  parameters  such  as  axial 
ratio  and  front-to-back  ratio.  Used  in  this  way,  a 
model  cast  in  a  certain  practical  meshing 
resolution,  was  optimised  for  pattern  shape  at  the 
desired  GPS  LI  frequency.  Figure  4.4  shows  two 
orthogonal  elevation  cuts  that  are  predicted  to 
occur: 


5.  COMPLETE  MODELLING  OF  THE  FEED 
SYSTEM 

Some  improvements  in  the  modelling  of  the  feed 
to  more  closely  match  the  feed  used  in  actual 
measurements  were  implemented.  Firstly,  the 
number  of  cells  used  to  model  the  feed  were 
increased.  This  is  however  limited  by  the 
allowable  minimum  cell  size.  Secondly,  to  better 
predict  antenna  performance,  it  is  necessary  to 
model  the  feed  system  as  physically  close  to  the 
practical  feed  system  as  possible.  Any 
approximation,  further  reduces  the  model’s  ability 
to  predict  performance.  Previous  simulation 
models  implemented  50Q  cable  dimensions, 
while  actual  measurements  used  a  10f2  cable, 
with  a  X/4  transformer  length  to  50Q  connectors. 
Using  10Q  cables  were  more  predictive  of  actual 
measurements,  with  the  predicted  frequency 
closer  to  measured  results.  Overall,  the 
simulation  frequencies  were  about  8-10%  lower 
than  measurements,  as  shown  in  Figure  5.1 . 


Figure  5.1:  Simulation  models  using  50Q — 
and  10Q+X/4  —  transformer  cables 


6.  CONCLUSIONS 

The  development  of  GPS  quadrifiiar  helix 
antennas  using  a  TLM  time  domain  simulator  has 
been  undertaken.  Traditional  meshing  criteria 
concerning  the  appropriate  number  of  cells  per 
guide  wavelength  to  promote  a  desired  degree  of 
predictive  accuracy  break  down  for  the  quadrifilar 
helix  antenna.  This  is  because  the  resolution 
required  to  model  the  operation  of  two  similar 
resonators  which  create  orthogonal  phase 
between  them  has  to  be  fine  enough  to 
accurately  represent  the  very  subtle  difference  in 
physical  length  that  is  required.  Basically  the 
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resolution  that  has  to  present  is  being  multiplied 
by  the  Q-factor  of  the  resonators  that  are  being 
modelled. 

In  any  electromagnetic  optimisation  program  it  is 
vital  to  choose  the  optimisation  criteria  very 
carefully.  The  use  of  probe  points  to  gain  vector 
information  that  is  indicative  of  field  resonance  is 
the  fashion  that  is  described  here  is  only  valid  if 
the  model  is  resolved  sufficiently  to  ensure  that  at 
the  micro  level  close  to  the  probe  point  the  local 
conditions  adequately  represent  the  conditions 
that  are  a  consequence  of  the  resonance  of  the 
global  whole. 

Both  these  conclusions  argue  for  a  significant 
increase  in  the  resolution  of  the  computer  model 
that  is  required  to  model  the  component. 
However  it  is  another  conclusion  of  this  work  that 
a  lower  level  of  resolution  can  provide  good 
insight  into  the  principles  of  operation  of  the 
component  to  a  degree  that  improves  with  model 
reality. 
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Abstract 

Modem  radar  sensors  require  efficient  low  profile  antennas  over  a  reasonable  frequency 
bandwidth,  which  need  to  be  flush  mounted  or  integrated  in  curved  surfaces.  The  applications 
reach  from  spacebome  and  airborne  radar  sensors  to  more  commercial  applications  for  example 
with  antennas  being  integrated  into  cars  for  automatic  cruise  control.  Besides  the  fitting  to 
curved  structures,  mostly  due  to  aerodynamically  reasons,  conformal  antennas  offer  a  variety  of 
beamforming  and  beamsteering  capabilities  which  are  difficult  to  obtain  with  fixed  planar 
antennas.  Cylindrical  phased  array  antennas,  for  example,  offer  the  potential  either  to  cover 
360°  in  azimuth  by  a  omnidirectional  pattern  or  to  create  a  sharp  beam  in  an  arbitrary  azimuth 
direction  using  the  same  array  antenna.  This  option  is  particularly  useful  for  communication 
applications  needing  an  omnidirectional  pattern  in  the  acquisition  phase  and  a  high  gain  antenna 
during  the  communication  link  to  enlarge  the  signal-to-noise  ratio. 

This  contribution  presents  aperture  coupled  patch  antennas  on  a  cylindrical  surface  having  more 
than  20%  bandwidth  (-10  dB)  at  S-  and  C-band  for  both  radar  and  communication 
applications.  Besides  the  modelling  measured  data  for  the  elements,  the  subarray  used  and  the 
array  is  reported.  Directional  patterns  for  the  circular  patch  array  are  synthesized  and  measured 
using  a  beamforming  network. 

Introduction 

New  radar  systems  require  efficient,  light  weight,  low  profile  antennas  over  a  frequency 
bandwidth  of  10%  -  20%  to  embed  the  antenna  in  the  surface.  There  are  different  reasons 
requiring  conformal  antennas.  In  airborne  applications  the  fitting  to  an  aerodynamic  body 
minimises  drag,  in  commercial  applications  like  automatic  cruise  control  for  cars  and  distance 
monitoring  for  parking  a  conformal  antenna  can  be  hidden  to  allow  aesthetically  looking 
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products  with  full  electrical  functionality. 

Conformal  antennas  are  a  solution  to  the 
geometrical  problem. 

Besides  the  capability  for  integration  in  curved 
surfaces  conformal  antennas  offer  more 
flexible  radiation  characteristics  not  known 
from  mechanically  fixed  planar  antennas. 

The  potential  for  both  omnidirectional 
coverage  and  narrow  beams  at  arbitrary 
azimuth  angles  is  particular  interesting  for  the 
next  generation  communication  and  network 
systems  where  space  division  multiple  access 
(SDMA)  techniques  are  required.  The  finite 
spectrum  of  frequencies  will  be  the  major 
limiting  factor  and  SDMA  systems  are  one 
way  to  overcome  this  problem.  By  sharp 
beam  forming  it  is  possible  to  use  the  same 
frequency  simultaneously  in  different  Fis' I:  3D-riew  of  an  apemre  coupled  patch  on  a 
directions.  Additionally  either  the  power  cylindrical  surface 

consumption  for  both  base  stations  and 

handsets  can  be  reduced  during  the  communication  link  or  the  coverage  area  of  the  base  station 
can  be  increased. 

A  key  point  in  these  systems  is  the  base  station  antenna,  which  has  to  accept  new  users  just 
applying  for  a  link  as  well  as  to  maintain  communication  or  data  links  with  high  data  rates.  Base 
station  antennas  therefore  need  omnidirectional  characteristics  as  well  as  sharp  beams  providing 
the  users  with  a  sufficient  signal-to-noise  ratio.  These  requirements  can  best  be  met  by  a 
cylindrical  structure. 

This  contribution  presents  the  analysis  of  the  structure  shown  in  fig.  1.  For  mounting  purposes 
a  cylinder  with  a  metallic  surface  is  placed  in  the  center.  The  metallized  surface  simplifies  the 
modelling  but  the  presented  model  is  not  restricted  to  these  surfaces.  This  cylinder  is  covered  by 
3  layers:  A  foam  layer,  e.g.  ROHACELL,  to  keep  the  distance  between  substrate  and  metallic 
cylinder,  a  substrate  layer,  e.g.  RT/duroid,  for  the  feed  line  on  the  inner  surface  and  the 
aperture  on  the  outer  surface.  An  outer  foam  layer  carrying  the  patch  completes  the  aperture 
coupled  patch  structure.  The  feeding  network  as  well  as  the  coupling  slots  may  be  etched  on  a 
planar  substrate,  which  is  bent  afterwards  around  the  cylinder,  eventually  by  heating  the 
substrate  material  to  moderate  temperatures.  The  patch  is  etched  on  a  thin  foil  and  placed  on  the 
outermost  foam  layer. 

Modelling 

The  aperture  is  replaced  by  equivalent  magnetic  currents  Ms,  which  allows  the  splitting  of  the 
structure  in  2  parts.  The  microstrip  line  and  the  patch  are  replaced  by  a  two  dimensional  electric 
current  density  J.  By  enforcing  vanishing  tangential  electric  fields  on  the  patch  and  the 
microstrip  line  and  the  continuity  of  the  magnetic  field  in  the  aperture  a  set  of  integral  equations 
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(1)  -  (3)  with  unknown  currents  on  the  patch,  on  the  microstrip  and  in  the  aperture 
can  be  set  up.  For  the  equations  a  conventional  cylindrical  coordinate  system  with  the  z 
direction  along  the  axis  of  the  cylinder  is  used.  Besides  these  currents  the  Green's  functions 
QikFQ  occur  in  the  equations,  where  k  represents  the  number  of  the  plane  containing  the  source 
Q  and  i  represents  the  plane  where  the  field  F  is  calculated,  respectively. 

jjG”EJ'jpd(i>dz  +  JJ  G32EmS°  M^dtpdz  =  0  (1) 

Ap  As 


JJ  G2W,‘JLdipdz  =  JJ  -  JJ  G22flM*  Msd<l>dz  (2) 

Al  Af  As 

JJ  GnEjL  JLd<t>dz + JJ  GnEM*  MSid<j)dz  =  0  (3) 

Al  As 

The  accurate  calculation  of  the  Green’s  functions  is  done  using  a  transfer  matrix  method  as 
described  in  [1].  By  using  this  method  even  for  large  values  of  the  radius  the  functions  can  be 
calculated  numerically  stable. 

In  order  to  save  calculation  time  the  solution  of  (3)  is  omitted.  This  can  be  done  by  introducing 
a  reflection  coefficient  on  the  feeding  microstrip  line  assuming  a  quasi  TEM  wave  propagating 
along  the  line  in  z  direction.  This  technique  is  well  known  for  planar  structures  [2]  and  adapted 
here&for  the  cylindrical  case.  The  reflection  coefficient  on  the  line  for  the  cylindrical  case  (4) 
looks  similar  to  the  one  in  the  planar  case  [2]  and  includes  the  tangential  magnetic  field  at  the 
slot  due  to  the  incident  magnetic  field  on  the  microstrip  line  h<j)  and  the  magnitude  of  the 
magnetic  source  in  the  slot  M<j>.  The  magnetic  field  h^  in  turn  can  be  calculated  using  (5)  where 
Zl  denotes  the  impedance  of  the  microstrip  line,  G  the  Greens  function  in  the  spectral  domain 
and  Jj  the  Fourier  transforms  of  the  feedline  current's  form  function  in  ^-direction. 

fl  =  |jjM0  h^(<t>,  r  =  rs)d<pdz  (4) 


h{<P’r  =  rs)=  2nl[z[  (K  ~  ~kzn)Jf  (m)e 


(5) 


Using  the  reflection  coefficient  (4)  and  the  magnetic  field  of  the  feedline  mode  (5)  the  resulting 
tangential  magnetic  field  at  the  inner  border  of  the  aperture  H^s  due  to  the  electric  current  on  the 
feedline  JL  can  be  expressed  without  explicitly  mentioning  the  current  on  the  feedline. 

With  this  excitation  model  equations  (2)  and  (3)  can  be  replaced  by  (6)  resulting  in  a  set  of  2 
integral  equations  which  need  to  be  solved  simultaneously  for  the  unknown  current  densities. 
This  omits  the  calculation  of  the  exact  two  dimensional  current  distribution  on  the  feedline. 


=-jjG22HMSMsd(f>dz  +  \\G23HjfJpd(pdz 

As  AP 


(6) 
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Equations  (1)  and  (6)  are  solved  using  a  methods  of  moments 
technique  with  a  Galerkin's  scheme.  Entire  domain  basis 
functions  are  used  to  model  the  current  distribution  because  it  is 
known  from  the  analysis  of  planar  aperture  coupled  patches  that 
the  current  distribution  on  these  structures  can  be  modelled  very 
accurately  by  these  functions.  In  addition  this  approach  needs  little 
computation  time  as  the  number  of  unknown  coefficients  can  be 
kept  low. 


Fig.  2:  Front  view  of  the  S- 
band  element 


Results  2  2,2  2,4  2.6  2,8  3 

Frequertcy/GHz 

Two  different  antennas  are  designed  _.  ,  .  _  .  _  .  _  .  _ ,  , 

Fig.  3  Measured  tnput  reflection  coefficient  of  the  S-band 

in  order  to  look  at  different  values 

_  ,  m  ,  ,  element 

for  kor.  The  S-band  antenna,  o 

designed  for  2.45  GHz,  consists  of  330  _  30 

a  metal  cylinder  (15  mm  diameter,  see  m  0  j 

fig.  1)  covered  by  22.5  mm  foam.  For  the  ®  '10  '  60 

feedline  substrate  a  0.5  mm  RT/duroid  was  |  -20  :  nAX 

chosen,  covered  by  10  mm  foam  as  substrate  J*  ,30  /  fl/7^  -A  \  \ 

for  the  patch.  The  C-band  antenna  for  /  u  (  ’■  y  1 

-40  I - TC •  (  - )■'  .*■«  - 4 H  90 

5.2  GHz  is  made  of  a  metal  cylinder  |  J 

( 35  mm  in  diameter)  covered  by  14  mm  \  / 

foam,  0.5  mm  RT/duroid  and  4  mm  foam  - co-poi ©2.4 ghz /jf 

for  the  patch.  —  .  2S  @  GHz  - /  dea 

. X"Pol  @  2.5  GHz  ?  9 

S-band  element  — co-poi@2.sghz  =====i==*==s^ 

. X-Pol  @2.6  GHz 

The  S-band  single  element  is  displayed  in  180 

fig.  2  showing  a  pen  for  comparison.  The  Fig.  4  Measured  pattern  in  the  azimuth  plane  of 

measured  input  reflection  coefficient  for  this  the  S-band  element 

element  is  plotted  in  fig.  3  showing  a 

-10  dB  bandwidth  of  20.8%  (2.32  GHz  -  2.86  GHz)  indicated  by  light  grey  and  a  -14  dB 
bandwidth  of  12.6%  (2.38  GHz  -  2.70  GHz)  indicated  by  dark  grey.  The  pattern  measured  in 
the  azimuth  plane  is  displayed  in  fig.  4.  A  half  power  beamwidth  of  83°  and  a  maximum 
crosspolarization  level  of  -19.5  dB  is  observed. 
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5.4  GHz  while  the  maximum  of  the 
crosspolarization  does  not  change  much  with 
frequency  and  keeps  below  -30.5  dB  for  all 
measured  frequencies.  The  measured  gain  of 
the  1x2  subarray  is  plotted  in  fig.  8  showing 
a  minimum  gain  of  8.6  dBi  within  the 
-10  dB  bandwidth  and  9.5  dBi  within  the 
-14  dB  bandwidth. 

C-batid  array 

Using  8  subarrays  a  8x2  anray  having  a 
period  of  45°  is  fabricated  and  measured 
using  various  excitations.  Different 
beampointings  for  the  array  with  respect  to 
the  element  position  are  calculated,  one 
between  two  elements,  one  in  front  of  an 
element  and  one  in  between.  Fig.  9  shows 


the  patterns  for  this  3  beampositions  using  8  excited  elements.  Neither  the  sidelobe  level  nor  the 
half  power  beamwidth  nor  the  calculated  gain  of  13.5  dBi  gain  shows  a  significant  change  with 
beam  position.  For  the  beam  pointing  towards  0°  with  respect  to  element  no.  4  the  measured 
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result  is  shown  in  fig.  10.  The 
measured  half  power  beamwidth 
varies  with  frequency  from  38°  to  39°, 
the  sidelobelevel  is  below  -12  dB  and 
the  crosspolarization  is  better  than 
-22  dB. 

Conclusion 

An  analysis  for  aperture  coupled 
patches  on  cylindrical  surfaces  using 
entire  domain  basis  functions  was 
presented.  Broadband  elements  and 

Fig.  8 


o 


4,5  5  5,5  6 


Frequency/GHz 

Measured  gain  of  the  C-band  1x2  subarray 


Fig.  9  Calculated  scanning  beam  of  the  8x2  Fig.  10  Measured  beam  of  the  8x2  array  with  a 
array,  beamposition  with  respect  to  beamposition  Of  0°  with  respect  to  element 

element  no.  1  no.  4 


arrays  were  fabricated  in  S-  and  C-band,  respectively.  The  measurements  show  a  bandwidth  of 
more  than  20%  for  both  antennas.  The  measured  results  show  the  capability  of  the  array  to  form 
a  directive  sector  beam  as  well  as  an  omnidirectional  pattern  necessary  for  SDMA  base  stations. 
As  no  gain  variation  occurs  while  scanning  the  direction  the  array  is  also  recommended  for  360° 
scanning  radar  sensors. 
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I.  Introduction 

Absorbing  Boundary  Conditions  (ABCs)  tire  essential  elements  for  solving  open 
region  radiation  or  scattering  problems  because  they  allow  limiting  the  computational 
domain  to  a  finite  size.  Several  ABCs  were  developed  for  outer  boundaries  that  form 
canonical  shapes.  Most  practical  radiation  or  scattering  problems  have  geometrical 
shapes  that  do  not  conform  to  a  box,  circle  or  sphere.  Hence,  when  using  any  of 
these  shapes  as  outer  boundaries  for  mesh  termination,  the  white  space  around  the 
scatterer  might  be  unnecessarily  large  resulting  in  costly  simulation  in  terms  of  both 
memory  and  run  time. 

To  address  this  problem,  ABCs  were  developed  that  can  be  applied  to  boundaries 
that  conform,  as  close  as  practicable,  to  the  radiating  geometry.  Several  attempts 
were  made  to  apply  Bayliss-Turkel  (BT)  ABCs  to  non-circular  outer  boundaries  (see 
[1]  as  a  representative  example  of  such  efforts).  In  these  works,  the  BT  operators  were 
projected  onto  a  non-circular  boundary  while  employing  different  approximations  for 
mixed  partial  derivatives.  Another  class  of  flexible  mode  —  annihilating  ABCs  were 
developed  that  were  also  applied  to  non-circular  outer  boundaries  [2]  (see  also  [3]  and 
references  therein).  In  all  of  these  previous  ABC  constructions,  the  outer  boundary 
had  to  be  positioned  few  wavelengths  from  the  nearest  surface  of  the  structure  to 
obtain  practical  levels  of  accuracy  [3]. 

In  a  total  departure  from  the  philosophy  employed  earlier,  Kreigsman  et  al.  [4] 
applied  the  BT  operator  directly  on  the  surface  of  the  scatterer,  while  assuming  that 
the  origin  of  waves  is  at  the  center  of  the  osculating  circle  at  each  outer  boundary 
node.  The  result  of  the  application  of  Kreigsman  et  al.  was  not  very  satisfactory 
because  it  was  only  possible  to  apply  second-order  BT  operators  (higher-order  oper¬ 
ators  employ  radial  derivatives  and  thus  cannot  be  determined  apriori  on  the  surface 
of  the  structure).  However  the  novelty  of  using  local  scattering  centers  in  the  work 
of  Kreigsman  et  al.  lead  others  to  extend  the  concept  of  the  local  scattering  center 
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to  outer  boundaries  that  are  positioned  at  a  distance  from  the  structure’s  surface  [31 

[5], [6].  1  J’ 

The  comparative  study  of  Lichtenberg  et  al.  [3]  showed  that  enforcing  BT  opera¬ 
tors  with  local  scattering  centers  outperforms  other  ABCs  that  use  a  single  point  of 
origin  for  all  boundary  points.  We  will  refer  to  the  Nth  order  BT  operator  applied 
with  a  local  scattering  center  as  BT(^cal .  However,  despite  its  superior  performance 
in  comparison  to  other  operators,  the  BT '^)eal  does  not  give  practical  accuracy  when 
enforced  close  to  the  scatterer.  In  [3],  for  instance,  good  accuracy  level  was  possible 
only  when  the  outer  boundary  was  pushed  two  wavelengths  away  from  the  structure. 
Third-  or  higher-order  BT  operators  were  not  implemented  for  application  on  cir¬ 
cular  or  non-circular  outer  boundaries  because  of  the  complexity  of  their  previous 
formulations. 

In  a  recent  work,  BT  operators  were  implemented  in  an  exact  fashion  resulting  in 
an  appreciably  enhanced  accuracy  [7].  The  exactness  of  this  recent  implementation 
and  the  relative  superiority  of  BT?0?al  in  comparison  to  previous  techniques  makes  it 
only  logical  to  extend  the  application  of  higher-order  BT  operators  to  non-circular 
outer  boundaries  using  the  concept  of  local  scattering  centers.  This  paper  develops 
these  operators  which  are  implemented  without  any  approximation  other  than  the 
discretization  needed  to  transform  derivatives. 

Other  mesh-truncation  techniques  that  were  developed  for  non-circular  outer  bound¬ 
aries  in  frequency  domain  include  the  class  of  material-based  terminations.  This 
includes  impedance  boundary  conditions  and  perfectly  matched  layers  [8]- [10].  How¬ 
ever,  these  techniques  witnessed  analytical  formulation  only,  and  no  numerical  results 
have  been  made  available  to  test  their  effectiveness. 

II.  ABCS  FOR  CONVEX  BOUNDARIES 

The  development  here  will  be  demonstrated  by  solving  the  problem  of  plane  wave 
scattering  from  a  perfect  electric  conductor  (p.e.c.)  in  two-dimensional  space.  The 
method  of  solution  employed  here  will  be  the  Finite  Elements  Method  (FEM),  however 
the  discussion  applies  equally  to  the  Finite  Difference  Method. 

The  governing  Helmholtz  equation  is  given  by 


V2u  +  k2u  =  0  (1) 

where  k  is  the  wave  number.  Equation  (1)  is  subject  to  Direchlet  or  Neuman  (or 
both)  boundary  conditions  on  the  surface  of  the  scattering  object. 

Consider  the  scattering  object  shown  in  Fig.  1.  Using  FEM,  we  mesh  the  region 
bounded  by  the  p.e.c.  scatterer  and  the  object- conformable  outer  boundary,  as  shown 
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in  Fig.  1.  The  Nth  order  BT  operator  specified  at  a  circular  boundary  with  origin  at 
p  =  0,  is  given  by 


Bn 


2 i  -  3/2 
P 


(2) 


We  enforce  (2)  on  the  outer  boundary  at  each  node  using  the  osculating  circle 
approximation.  This  requires  determining  the  curvature  of  the  osculating  circle  at 
each  node  (the  node  and  two  adjacent  ones  are  sufficient  to  determine  the  curvature). 
Effectively,  we  can  represent  the  ABC  at  each  node  as 


<..  =  n(^  +  ^(2i-3/2)+ii)  (3) 

Knode  represents  the  curvature  of  the  osculating  circle  at  outer  boundary  nodes.  For 
the  special  case  of  zero  curvature,  as  in  the  case  of  planar  terminal  boundaries,  (3) 
reduces  to 


(4) 

When  the  outer  boundary  is  planar  and  coinciding  with  the  Cartesian  planes,  (4) 
reduces  to  Higdon’s  boundary  condition  when  it  is  applied  in  the  frequency  domain 
(after  the  transformation  of  dt  to  jk)  [11]. 

III.  Numerical  Experiment 

To  demonstrate  the  effectiveness  of  the  proposed  technique,  we  consider  the  problem 
of  scattering  by  a  1.95A  X  1.95 A  perfectly  conducting  square  cylinder.  The  most 
suitable  outer  boundary  for  this  geometry  is  a  square  as  shown  in  Fig.  2.  The  outer 
boundary  is  positioned  such  that  the  separation  between  it  and  the  conductor  is  0.35A. 
The  spacing  between  node  layers  is  as  before. 

Figures  3(a)  and  3(b)  show  the  magnitude  and  phase  of  the  electric  field  on  the 
observation  contour  T  (see  Fig.  2)  as  calculated  using  the  FEM  solution.  A  total  of 
164  nodes  span  the  observation  contour.  The  numbering  of  the  nodes  starts  at  the 
lower  left-hand  corner  and  proceeds  clockwise.  So  node  number  20  corresponds  to  the 
middle  point  on  the  left-hand  side,  and  node  number  105  corresponds  to  the  middle 
point  on  the  righ-hand  side.  Results  are  only  shown  for  field  values  on  the  upper  half  of 
the  contour  due  to  the  symmetry  of  the  problem.  For  comparison,  the  MoM  solution 
is  also  provided  for  this  problem.  The  FEM  solutions  were  obtained  using  BTiocal 
and  BT*ocal.  The  agreement  between  BT*ocai  and  MoM  solutions  is  very  satisfactory, 
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especially  in  the  observation  region  closest  to  the  corner  of  the  conducting  box.  In 
fact,  it  is  observed  from  Fig.  3(a)  that  the  maximum  error  in  the  field  magnitude  is 
kept  below  1.2%  over  the  en  tire  observation  contour  when  BT{ ^cal  is  used. 


IV.  Conclusion 

This  paper  presented  the  development  of  a  new  class  of  ABCs  that  can  be  ap¬ 
plied  on  non-circular  convex  mesh-termination  boundaries.  The  new  ABCs  are  based 
on  the  exact  application  of  BT  operators  using  local  scattering  centers.  This  work 
demonstrated  the  application  to  scattering  problems  in  two-dimensional  space  and  in 
the  frequency  domain.  However,  these  ABCs  and  their  implementation  are  directly 
applicable  to  scattering  problems  in  three  dimensional  space  and  to  time-domain 
problems.  These  extensions  will  be  discussed  in  the  presentation. 
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Fig.  1.  Diagram  illustrating  an  object-conformable  mesh 
termination  boundary 


Fig.  2.  Computational  domain  used  for  the  problem  of  scattering 
by  a  perfectly  conducting  square  cylinder. 


494 


CYLINDRICAL  AND  CONICAL  FSS 


J  C  Vardaxoglou’,  G  Loukos,  M  Jayawardene,  R  S eager.  J  Eade 

'Centre  for  Mobile  Communications  Research 
Department  of  Electronic  and  Electrical  Engineering 
Loughborough  University 
Loughborough  Leicestershire  LEI  1  3TU 


Abstract 

The  properties  of  resonant  frequency  selective  guides  and  horn  antennas  are  reviewed.  The  importance 
of  the  array’s  resonance  is  emphasised  and  is  an  essential  factor  in  the  performance.  The  computational 
procedure  for  the  primary  propagation  and  radiation  characteristics  of  these  structures  is  discussed. 


Introduction 

This  paper  reviews  the  fundamental  properties  of  resonant  frequency  selective  guides  (FSGs),  [1].  These 
are  constructed  from  periodic  arrays  that  are  frequency  selective,  and  can  be  viewed  as  guiding 
structures  at  resonance  and  ‘open’  (leaky)  elsewhere.  Computer  models  have  been  developed  to  predict 
the  propagation  and  mode  composition  of  FSGs  acting  as  a  waveguides  with  a  circular  cross  section. 
Since  these  are  primarily  open  structures,  the  determination  of  the  complex  propagation  constant  was 
examined.  This  information  was  necessary  in  order  to  ascertain  the  field  distributions,  which  in  turn 
affect  their  radiation  properties.  The  latter  may  be  of  the  form  of  leaky  or  end-fire  due  to  guided  modes. 
The  structures  employ  doubly  periodic  elements  of  transverse  dipoles  and  square  loops  shown  in  Fig.  1 
and  Fig.  2  respectively.  The  radiation  properties  of  the  waveguides  formed  into  conical  antennas  are 
discused.  As  well  as  the  information  given  from  the  propagation  characteristics  a  computer  model  has 
been  developed  to  predict  the  radiation  a  finite  model  of  a  conical  structure  of  dielectrically  loaded 
dipole  elements.  In  the  computations  a  finite  array  analysis  has  been  used  and  the  boresight  frequency 
response  and  radiation  patterns  can  be  predicted.  The  analysis  considers  a  finite  array  of  conducting 
elements  coated  with  a  thin  arbitrary  material.  This  may  be  of  dielectric  or  ferrite  material. 


Propagation  characteristics 

The  computer  model  produces  the  complex  propagation  constant  and  plots  the  aperture  fields  and  field 
lines  of  the  modes.  The  analysis  is  based  upon  the  solution  of  the  wave  equation  (in  cylindrical  co¬ 
ordinates)  in  conjunction  with  Floquet’s  theorem  [2],  The  fields  have  been  set  up  as  combinations  of 
Bessel  and  Hankel  functions.  The  Method  of  Moments  (MoM)  with  rooftop  look  alike  basis  and  testing 
functions  has  been  used  to  determine  the  propagation  (p)  and  attenuation  (a)  constants  of  a  homogenous 
system.  Once  these  were  found  the  elements’  currents  and  fields  on  the  unit  cell  of  the  array  were 
calculated  by  matrix  inversion.  Because  of  the  large  orders  and  arguments  involved,  necessary  steps 
have  been  taken  in  the  calculation  of  these  functions.  The  propagation  characteristics  are  obtained  by  a 
method,  which  chooses  the  appropriate  Riemann  surfaces.  The  latter  is  a  crucial  factor  for  bringing  out 
surface,  guided  and  leaky  waves. 
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For  the  dipoles  transverse  to  the  length  of  the  structure,  it  was  found  that  the  mode  with  the  lower  cut-off 
was  a  hybrid  EH.  HE  types  were  also  supported,  but  were  highly  attenuated.  We  have  found  distinct 
differences  of  the  modes  at  different  frequencies.  Fig.  3  shows  the  real  P  and  imaginary  a  propagation 
constants.  The  leakage  is  evident  from  the  attenuation  constant  plot.  It  can  be  seen  that  firstly  a  surface 
wave  appears,  followed  by  a  leaky  (complex)  wave.  The  monitoring  of  the  attenuation  constant  and 
guiding  modes  supported  at  different  frequencies  have  been  extracted  from  the  dispersion  diagram  by 
solving  a  homogeneous  matrix  system.  The  resonance  is  identified  by  observing  the  region  of  minimal 
leakage.  The  minima  of  the  attenuation  therefore  correspond  to  guided  waves.  Square  loop  arrays  (Fig. 
4)  produced  minima  with  small  attenuation  constant.  There  is  a  correspondence  between  the  transition 
regions  (before  and  after  the  resonance)  and  the  leak}'  behaviour  of  the  structure.  The  effects  of  the 
periodicity  and  element  dimensions  have  shown  to  affect  the  frequency  and  level  of  the  leakage. 

Linear  dipoles  exhibit  weak  guidance,  but  square  loops  have  clearly  identified  guidance  with  small 
insertion  loss.  The  dominant  factor  here  is  the  resonance  of  the  array,  opening  new  avenues  in  leaky 
wave  antennas.  A  study  has  been  done  on  the  latter  in  a  related  project.  Unlike  the  traditional  leaky 
wave  antennas,  Frequency  Selective  Structures  have  close  element  spacing,  so  there  is  little  interference 
from  grating  lobes  and  scan  blindness. 


Radiation  patterns 

Fig.  5  shows  a  drawing  of  the  frequency  selective  horn  antenna  as  well  as  the  geometry  of  the  dipole  and 
its  coat.  The  horn  is  fed  with  a  cylindrical  waveguide  carrying  the  dominant  EHn  hybrid  mode, 
available  from  the  propagation  results  discussed  above.  The  conventional  TEn  of  a  closed  waveguide 
has  also  been  used.  This  in  turn  has  been  modelled  using  a  superposition  of  point  sources  at  each 
element  (segment)  position.  The  Electric  Field  Integral  Equation  (EFIE)  with  an  arbitrary  coating,  of 
relative  permeability  and  permitivity  sr  and  \ir  respectively,  takes  the  following  form: 
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/  is  the  current  flowing  on  the  dipole  of  length  L,  G  is  the  Green's  function  and  E?  is  the  incident  electric 
field.  ra  and  denote  vectors  regarding  the  inner  and  outer  radii  of  the  cylindrical  layer  of  the  coating. 
The  EFIE  is  transformed  into  a  matrix  system  having  segmented  the  elements.  This  system  has  been 
solved  by  a  conjugate  gradient  iterative  method  and  an  elimination  technique. 

Varying  the  dielectric  thickness  has  shifted  the  in  band  (resonance)  of  the  antenna  as  well  as  broaden 
the  bandwidth.  This  is  depicted  in  Fig.  6,  where  it  can  be  observed  that  the  gain  is  also  affected.  The 
source  and  elements  have  been  modelled  in  a  finite  manner  by  rendering  the  feeding  aperture  discrete. 
This  approach  ensures  that  the  excitation  is  properly  taken  into  account  and  will  aid  in  matching  the 
horn.  The  method  also  uses  the  MoM,  but  being  finite  large  matrices  are  involved.  The  size  of  the 
matrix  is  typically  2,000  by  2,000  elements  for  a  10  cm  long  hom.  The  model  predicted  well  the 
radiation  performance  on  boresight,  reproducing  the  high  gain  at  the  resonant  frequency.  Broadly  the 
radiation  patterns  were  reproduced,  but  there  are  some  discrepancies  of  the  sidelobe  location  and  levels. 
This  point  requires  further  investigation.  The  radiated  fields  were  calculated  for  the  relevant  mode.  In 
addition,  low  crosspolarisation  was  achievable  using  tightly  packed  symmetrical  element,  such  square 
loops  and  rings.  Far  field  patterns  of  structures  with  dipoles,  crossed  dipole  and  loop-type  elements  with 
various  orientations  have  been  compared.  Fig.  7  shows  the  boresight  gain  relative  to  the  open-ended 
waveguide  of  an  FSG  with  the  same  element  gometry.  Although  the  band  centre  is  near  15  GHz,  the 
return  loss  is  best  at  slightly  below  that  frequency.  The  E-plane  radiation  pattern  at  that  frequency  is 
shown  in  Fig.  8. 

The  element  geometry  has  a  noticeable  influence  on  the  gain  and  bandwidth  of  the  FSG.  Whereas  dipole 
elements  have  narrow  bandwidths,  circular  elements  and  square  loops  produce  wider  bandwidths 
associated  with  a  gain  increase.  The  bandwidth  increase  is  related  to  the  sensitivity  of  the  internal 
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reflections  to  the  incident  angles.  Horns  with  cross  dipoles  produce  two  bands  whose  separation  depends 
on  the  dimension  of  their  individual  arms.  The  lower  wave  band  is  dominated  by  a  surface  wave. 

Closing  comments 

Cylindrical  resonant  arrays  of  transverse  dipoles  and  square  loops  have  been  introduced  and  studied. 
These  structures  exhibit  attractive  characteristics  regarding  their  mode  and  radiation  behaviour.  They 
are  also  easy  to  construct  and  lightweight.  The  propagating  modes  are  hybrid  with  symmetries  in  certain 
frequencies'  Studies  so  far  have  shown  that  it  is  feasible  to  design  a  horn  antenna  to  operate  at  two 
separate  frequencies  having  similar  beam  patterns.  The  fruits  of  this  effort  would  lead  to  producing 
multifunctional  tasks  with  beam  shaping  and  frequency  scanning  that  ultimately,  would  greatly  enhance 
the  performance  of  antennas  in  mobile  and  wireless  network  communication  systems. 
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Abstract 

A  new  code  has  been  developed  which  calculates  the  response  of  an  infinite  planar 
frequency  selective  structure  with  an  arbitrary  number  of  coupled  layers  embedded  in 
isotropic  stratified  media.  All  layers  are  supposed  to  have  the  same  periodicity  but  they  can 
have  individual  element  shapes  and  displacements.  The  element  shape  of  each  layer  is 
approximated  by  a  polygon  with  an  arbitrary  number  of  comer  points.  The  media  in  which 
the  structure  is  embedded  are  defined  by  complex  permittivity  and  complex  permeability 
scalars.  The  geometry  of  the  problem  is  defined  with  a  graphical  user  interface,  which 
allows  selection  between  defined  element  types  including  dipoles,  tripoles,  crosses, 
jerusalem  crosses  and  anchors.  Each  element  type  can  be  defined  either  as  apertures  or  as 
conductors.  All  types  can  in  addition  be  loaded  and  the  corners  of  the  polygon  can  be 
smoothed.  The  code  is  based  on  Floquet  mode  expansion  and  convolution  with  the  Fourier 
transform  of  the  shape  function  of  the  element. 


1  INTRODUCTION 

Commercially  available  codes  are  generally  limited  either  to  a  single  simple  element  shape 
or  to  be  defined  on  a  grid  with  rectangular  periodicity  and  poor  resolution.  The  simply 
shaped  elements  show  large  sensitivity  on  incidence  angles  and  polarisation  and  a 
rectangular  grid  has  grating  lobes  at  lower  frequencies  than  a  hexagonal  grid  with 
comparable  element  size.  To  remedy  those  limitation  we  have  developed  a  new  code  which 
accepts  general  periodicity  and  arbitraiy  shaped  polygons.  An  extensive  literature  search 
showed  that  the  CG-FFT  method  [1],[2]  was  a  promising  candidate  to  fulfil  our 
requirements.  However  previously  published  results  was  on  rectangular  grids  and 
rectilinear  element  shapes.  Our  decision  was  therefore  to  modify  the  method  for  more 
general  geometries. 
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2  GENERAL  DESCRIPTION  OF  THE  PROBLEM 

We  would  like  to  calculate  the  transmitted  and  reflected  fields  when  we  are  incident  with 
a  plane  wave  with  given  frequency,  polarisation  and  incidence  angles  on  an  FSS  structure 
as  shown  in  Figure  1. 


z0  z\  Z2  Zt  Zt+l  ZL-1ZL 


Figure  1.  Single  FSS-layer  surrounded  by  dielectrics. 

The  FSS-layer  is  either  a  screen  with  perfectly  conducting  elements  or  aperture  elements 
in  a  perfectly  conducting  screen.  The  elements  are  arranged  in  a  doubly  periodic  pattern 

as  shown  in  Figure  2.  The  periodicity  is  defined  by  the  vectors  d\  and  d2  . 


Figure  2.  Periodic  pattern  with  grating  vectors 

3  THE  INCIDENT  FIELD 

We  divide  the  electromagnetic  field  into  an  incident  field  and  a  field  scattered  by  the 
periodic  pattern. 
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The  incident  field  which  tangential  components  are  continuous  is  given  by 


**  =  v*].-**”** 

=  [(a*  v +. - a*  + (a*  V + a* 't ♦. a* 

in  the  interval  zz_  j  <  z  <  Z/  where  z_j  =  -«> ,  z0  =  0 ,  _  ,  dk  and  zL  +  j  =  00 . 

The  wave  number  in  layer  number  /  is  given  by 

*'  =  ^e,n,lo-(4)2-(<)2 


Due  to  Maxwell’s  equations  the  longitudinal  E-field  is  related  to  the  transverse  E-field  by 

4,+  =  -(k‘f/k[)^U  fif  =  (3) 

and  the  H-field  is  also  related  to  the  transverse  E-field  by 


EJ*i  1+  _ 

HP  ~ 

V 

Kz  v 

Mo  0  ♦ 

< 

II 

l+  _ 

Zl\  pi,  1+ 
~Tyoep 
kz 

< 

II 

HU+  = 

A 

—  FoFa/+ 

h‘;'- 

=  _iL 

z 

H/* 0  0  * 

hence  the  whole  field  can  be  described  by  only  two  field  components. 


4  SCATTERED  FIELD  AND  INDUCED  CURRENTS 

The  FSS-layer  introduces  a  scattered  field  induced  by  the  currents  in  the  layer.  These 
currents  are  confined  to  the  conducting  parts  of  the  FSS.  Assuming  zero  thickness  and 
perfect  conductivity  the  tangential  components  of  the  E  field  are  continuous. 
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In  general  we  can  write  the  scattered  field  as  a  collection  of  Floquet  modes 


EL  = 


p<q 


+  (Ep'  q’l-pp’ q  +  JEf  q;l-$p'  q)eK*z]e~M'x  +  O) 


(5) 


where  kp<  q  =  k'x  +  pkx^  x  +  qk2t  x  and  kp' q  =  k‘y  +  pkx<  y  +  qk2i  y  -  The  notation  k^  x  denotes  the 
x  component  of  the  vector  k\  in  the  pair  of  vectors  defined  by 
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that  spans  the  reciprocal  grating. 

Similarly  the  scattered  H  field  can  be  written  as 


HL  =  £  r  <  it *  V «  -  t;  >•  ,;,y ■ 

-  (7j  p’  q'lE\  q;l'pp’ q  -  Ycp  p’  qJEp  q',l~$P’  q)e^’qJz]e-j^x  +  *?*?> 
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This  field  is  also  continuous  except  for  the  interface  number  t  where  the  FSS-layer  is 
present.  The  current  in  this  interface  is 


where  the  constants  can  be  determined  with  an  iterative  technique. 
Converting  to  Cartesian  coordinates  gives 
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with  kx  q  =  kp’  q/kPp  q  and  kP’ q 


=  t:q/k 


p,q 
p  ' 


5  INTEGRAL  EQUATION 


By  introducing  the  variables  s,  t  according  to  xx  +  yy  =  sd\  +  tdj  we  can  write  the 
scattered  field  in  the  FSS-layer  as 


-j(&  +  k'y ) 


(ID 


where  the  sums  can  be  identified  by  the  Fourier  transforms  of  the  double  discrete  series 
E?xp,q)  and  Es*p,q)  which  render  the  continuous  and  periodic  functions  Ex(s,  t)  and 

t ) . 


It  is  trivial  to  verify  that 


&**• q)  =  f  f  &s,  V2”"”  +  qndsdt  (12) 

and  a  similar  relation  holds  for  the  y-component. 

Due  to  the  perfect  conductivity  the  total  tangential  E  field  is  zero  at  the  conductors.  This 
is  written  as 


=  E* 
\aEy  =  Ey 


(13) 


where  Ex  is  shorthand  for  Ex(s,  t)  +  Elx  where  Elx  is  the  complex  amplitude  of  the  x- 
component  of  the  incident  E  field  in  the  FSS-layer,  and  is  a  function  that  is  1  in  the 
apertures  and  0  otherwise. 

Another  relation  that  must  be  fulfilled  is  that  the  current  is  0  in  apertures  which  can  be 
written  as 

%.miarlB,+rvr%)  =  (14) 

i,„HYxyr'~Ex+Yyyr'Ey)  =  ^(Oi+J£°4) 

where  7  denotes  the  Fourier  transform  defined  in  (11)  and  7~l  its  inverse  defined  in  (12). 
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6  TRUNCATION  IN  SPECTRAL  DOMAIN 

So  far  we  have  derived  an  equation  with  infinitely  many  unknowns.  In  order  to  get  an 
approximate  solution  we  have  to  truncate  in  the  spectral  domain  and  discretize  in  space 
domain. 

Discretization  in  space  domain  by  s  =  m/N  and  t  =  n/N  render  a  periodic  spectral 
domain  with  periodicity  defined  by  Nk\  and  Nk\ ,  thus  is  it  sufficient  to  specify  the  Floquet 
modes  in  a  unit  cell.  For  hexagonal  gratings  the  usual  approach  is  to  choose  a  rhombic  unit 
cell,  but  this  choice  destroys  the  symmetry  that  might  exist.  Thus  it  is  preferable  to  choose 
a  hexagonal  one  as  in  Figure  3. 


Figure  3.  A  hexagonal  (a)  and  a  rhombic  (b)  unit  cell  in  the  same  lattice 

A  simple  reordering  of  the  selected  Floquet  modes  makes  it  possible  to  perform  the  Fourier 
transforms  with  the  FFT  algorithm. 

The  discretized  version  of  t,a  contains  only  the  values  0  and  1,  which  means  that  the 
multiplications  with  represent  a  selection  of  those  points  that  lies  in  the  aperture.  Thus 
the  effective  size  of  the  system  is  reduced. 

The  resulting  linear  system  is  still  quite  large  (typically  2000  unknowns  with  N  =  50  and 
an  aperture  area  of  40%)  and  the  matrix  multiplications  are  fast  so  the  system  is 
preferably  solved  with  an  iterative  method.  Our  choice  was  the  biconjugate  gradient 
stabilized  method  (bicgstab  [3]),  since  it  in  most  cases  showed  a  rapid  convergence. 


7  CODE  VERIFICATION 

In  order  to  verify  the  code  it  was  tested  against  specific  cases  reported  in  literature.  As  an 
example,  our  results  presented  in  Figure  5.  shows  a  good  agreement  with  those  by 
Vardaxoglou  [1]  presented  in  Figure  4.  The  deviations  seen  are  due  to  the  fact  that  our  code 
uses  far  more  Floquet  modes  (approx.  2500)  compared  to  169  used  by  Vardaxoglou. 
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Figure  4.  Results  according  to  Vardaxoglou  for  a  grating  with  aperture  dipoles  on  a 
dielectric  substrate. 


Figure  5.  Results  from  our  code  for  the  same  geometry 


8  CONCLUSIONS 

We  have  a  code  which  accepts  generally  shaped  elements  in  multiple  layers.  It  generates 
results  that  agree  well  with  previously  reported  results.  Drawbacks  are  long  execution 
times  due  to  the  interpreting  nature  of  matlab.  Future  research  will  be  on  finding  a 
preconditioner  to  increase  the  convergence  rate  and  to  convert  the  program  into  a  compiled 
code  on  parallel  machines. 
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1.  INTRODUCTION 

This  paper  presents  an  efficient  method,  based  upon  the  reciprocity  theorem,  for  calculating  the  radiation  patterns  of 
conformal  patch  antennas  on  circularly  cylindrical  platforms  when  the  current  distribution  of  the  patch  antenna  is  available 
from  previous  computations.  This  approach  does  not  require  the  evaluation  of  complicated  layered  media  Green  s 
functions  for  curved  geometries.  It  also  circumvents  to  the  need  to  calculate  time  consuming  Sommerfeld  integrals. 
Furthermore,  this  method  is  directly  applicable  to  patch  antennas  of  arbitrary  shape. 

The  analysis  of  conformal  patch  antennas  has  evoked  much  interest  over  the  past  few  years  because  of  their  inherent 
advantages  for  a  wide  range  of  applications  [1-7].  Patch  antenna  implementations  are  highly  desirable  on  military  manned 
and  unmanned  aircraft  due  to  their  ability  to  aerodynamically  conform  to  the  shape  of  fuselages.  In  die  commercial  sector, 
patch  antennas  are  increasingly  being  used  in  mobile  communications  systems.  These  and  other  applications  are  driving  the 
requirement  for  highly  accurate,  fast  and  efficient  techniques  for  evaluating  conformal  antenna  performance  characteristics. 

2.  THEORETICAL  DEVELOPMENT 

A.  Domain  Decomposition  and  Reciprocity  Approach 

Consider  a  patch  antenna  backed  by  a  dielectric  substrate  and  mounted  on  a  circularly  cylindrical,  perfectly  conducting 
body  as  illustrated  in  Figure  1.  The  dielectric  constant  and  thickness  of  the  substrate  material  are  denoted  by  £r  and  h 
respectively.  It  is  well  understood  that  the  current  distribution  on  the  surface  of  a  flat  cavity  backed  patch  antenna  is  not 
appreciably  altered  by  the  introduction  of  a  moderate  degree  of  curvature  in  the  structure.  Hence,  for  many  problems  of 
practical  interest,  sufficient  accuracy  is  achieved  by  solving  for  the  current  distribution  on  the  surface  of  an  equivalent  flat 
patch  rather  than  the  original  curved  one  -  curvature,  of  course,  adds  considerable  complexity  to  the  problem.  Considering 
a  flat  vice  curved  patch  has  the  additional  advantage  in  that  several  commercial  Method  of  Moments  (MoM)  codes  are 
available  that  can  be  used  to  efficiently  analyze  flat  patch  antenna  configurations. 

Once  the  current  distribution  Js  has  been  determined,  we  can  apply  the  reciprocity  theorem,  in  a  manner  explained  below,  to 
efficiently  calculate  the  radiation  pattern  of  the  patch  antenna  on  the  circularly  cylindrical  platform  operating  in  the  receive 
mode.  The  first  step  in  the  reciprocity  approach  is  to  consider  TEZ  -  and  TM2  -  polarized  plane  waves  incident  upon  a 
dielectric-coated  metallic  cylinder  and  to  solve  for  the  total  electric  field  Et,  produced  on  its  surface  by  using  analytical 
techniques.  The  methodology  of  this  procedure  is  illustrated  in  Figure  2.  The  final  step  involves  carrying  out  the  required 
dot  product  between  the  surface  current  Js  and  the  total  electric  field  Et  and  integrating  this  result  over  the  surface  of  the 
patch. 

B.  General  Far- Field  Electric  Field  Expressions 

Using  the  Hertz  vectors,  an  electromagnetic  field  can  be  expressed  in  the  following  way  [8] 
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£  =  VxVxri-  icoiNxfV 

-  _  _  -  _  (l) 

H  =  ;7yfVxn  +  VxVxn‘ 

where  II  is  the  electric  Hertz  vector  and  FI*  is  the  magnetic  Hertz  vector. 

Beginning  with  the  Hertz  vectors  and  following  the  procedure  outlined  in  Section  A  leads  to  the  following  generalized  far- 
field  expressions  for  the  radiation  pattern 

/j=-oo 


Et(r’0)=~M 


,-JAr 


Aw  [Pp^ 

n=-«  L 

-p^ejj’Sriy^pp'pln,^ 


where  the  expressions  that  characterize  a  general  patch  current  distribution  are: 

Sz{n,e)  =  JJ  Jl{(p\z')e~in¥e-j^z'dfdz\  S,(n,6)  =  JJ  J ^  ,z!)e^  e^1' d^'dz\ 

5'  S' 

and  the  coefficients  are  given  by 

K  =4x]{AKfl '  A,2ft}-  K  =^fe]{A"*2 '  A,2Si}’ 

which  contain  the  following  parameters: 


(3) 


(4) 


(5) 
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a„  =B<2>'(^)-e/|^Y^-W)(^/),  Au  =A  2l  =  rX2W)’ 


\Pp\  )\Fl) 


A»  1  1-COS2  0 


&>.J  Vr-cos2 


The  variables  a  and  h  represent  the  inner  and  outer  radii  of  the  coated  cylinder,  respectively,  and  S  denotes  the  area  of  an 
arbitrarily-shaped  conformal  patch  located  on  the  outer  surface  of  the  circular  cylinder. 

C.  Azimuthal  Plane  Radiation  Pattern  Expressions 

A  special  case  of  particular  interest  is  the  set  of  far-zone  expressions  for  the  radiation  patterns  in  the  azimuthal  plane 
(6  =  90°): 


r,0)  =  jTh>P<p-~T—~  1  r[j,(Pfi)+cnHm(M^{n,90y->, 


£,M)=->0^r  s  jj/;(A*WX(2)  (Mjs,(«.90°yrt 

where  the  surface  integrals  involving  the  patch  current  distributions  are  now: 

S^n, 90°)  =  JJ  JifXy^dfdz',  Sf(n,90°)  =  jj  Jt(f,z')e~Mdl/l’dz', 


and  the  coefficients  reduced  to 
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c< = A<  iJMWLzJhMMEL  d‘ = rK  JJMltJhllMML 

4^rH[?\PJb)FyHW\pob)F'n  ’  "  4^Hi2]\p0b)Fn2 - H^{p0b)F* 


(9) 


where 


Yn{P  Ia) 

y,  (M 


y.  (M 


(10) 


and 


A  -  fi^Tr 


(11) 


D.  Application  of  Technique 

To  illustrate  the  analysis  procedure  we  consider  a  half- wavelength  axial  patch  (l  =  A0  /  2)  with  w  =  A0 12  mounted  on  a 
foam  substrate  (fr  =  l).  This  type  of  patch  configuration  was  originally  investigated  in  [8],  where  the  current  was  assumed 
to  be  of  the  form 


J,  = 


\  *  L  ( a  \  K,  ,  M>  W 

a  —co$(fi0zl -<Z<  —  and - <  — 

z  w  KHo  ’  4  4  2b  2b 


0, 


elsewhere 


(12) 


where  w  is  the  width  of  the  patch  along  the  arc  as  illustrated  in  Figure  3  and  the  current  distribution  represented  by  (12)  is 
as  shown  in  Figure  4. 

In  this  case,  closed-form  representations  for  the  integrals  in  (8)  may  be  found  which  are  given  by 


Sz(n,90°) 


24 

.  nw 
sin  — 

U  b) 

fit 

tyn.  90°)  =  0 


(13) 


Substituting  the  above  results  into  (6)  and  (7)  yields  the  following  fax-field  expressions  for  the  patch  antenna: 


.  ( nw 
Sin  2& 


nw 

2b 


(14) 
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£,M=o 


(15) 


Finally,  (14)  may  be  expressed  in  a  more  convenient  form  given  by 


= jvoh^pnfn”)^)  -7^ 

\2b, 


1,  n  =  0 

2,  n  >  0 


is  Neuman’s  number  and 

tM  (18) 

()=  /*f(A<0 

Equation  (16)  may  be  compared  directly  with  (30a)  from  [9]. 

E.  MoM  Calculation  and  Technique  Validation 

To  further  demonstrate  the  efficiency  and  validity  of  the  Reciprocity  approach  we  next  consider  an  example  that  uses  a 
numerical  method  to  arrive  at  the  current  distribution  Js  on  the  surface  of  the  patch.  Although  there  are  sewral 
commercially  available  method  of  moments  codes  for  analysis  of  microstrip  patch  antennas,  we  chose  to  use  ENSEMBLE 
to  model  the  patch  antenna  described  in  the  preceding  section.  The  computed  current  distributions  are  illustrated  m  Figures 
5  and  6  Figures  7  through  10  plot  the  normalized  radiation  patterns  in  the  azimuthal  plane  produced  by  this  patch  antenna 
for  a  fixed  value  of  h  -  0.1  Xo  and  different  radii  a/X*  of  the  metallic  cylinder.  The  plots  clearly  show  close  agreement  of 
results  thus  providing  a  validation  of  the  reciprocity  approach  presented  in  this  paper. 
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Figure  1.  Conformal  Patch  Antenna  Mounted  on  a  Circularly-Cylindrical  Platform. 


Figure  2.  Application  of  Reciprocity  Theorem. 
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Radiation  Patterns  for  the  Half-Wavelength  Axial  Patch  Antenna 
Mounted  on  a  Circulariy-Cylindrical  Platform  (9  =  90,  -180  <  if  <  180) 


Casel: 

w/Xo=0.S 

aAo=0.5 

hO^O.05 


Radiation  Patterns  for  the  Half-Wavelength  Axial  Patch  Antenna 
Mounted  on  a  Circularly-Cylindrical  Platform  (6  =  90,  -1 80  £  $  <  1 80) 


Figure  8 


Case  2: 
w/Xo=0.5 
aA0=1 .0 
hAosO.05 


Radiation  Patterns  for  the  Half-Wavelength  Axial  Patch  Antenna 
Mounted  on  a  Circularly-Cylindrical  Platform  (6  =  90,  -1 80  £  $  £  1 80) 


Case  3; 


aA^.O 

hA„=0.05 


Radiation  Patterns  for  the  Half-Wavelength  Axial  Patch  Antenna 
Mounted  on  a  Circularly-Cylindrical  Platform  {8  =  90,  -180  £  $  £  1 80) 


Case  4: 
wA,=0.S 
a/lo=20.0 
hA^nO.05 
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Abstract —  This  paper  addresses  2D  and  3D  hierar¬ 
chal  Ho(curl)  and  Hi  (curl)  vector  finite  elements  for  high- 
frequency  computational  electromagnetics,  in  particu¬ 
lar  for  eigenvalue  problems.  Some  background  is  given 
on  vector  finite  elements.  Ho  (curl)  and  Hi  (curl)  elements 
are  given  for  both  2D  and  3D.  Results  of  waveguide 
eigenvalue  analyses  for  circular,  single-  and  dual-ridged 
waveguides  are  presented.  Theoretical  results  for  the 
explicit  matrix  entries  for  the  3D  hierarchal  Hi(curl)  el¬ 
ement  are  presented. 

Keywords —  Hierarchal  vector  finite  elements;  compu¬ 
tational  electromagnetics. 

I.  Introduction 

Vector  finite  elements  (FE)  have  almost  entirely  dis¬ 
placed  scalar  functions  for  the  finite  element  analy¬ 
sis  of  high-frequency  electromagnetic  fields.  Because 
the  original  zero  /first  order  elements  replaced  the  tra¬ 
ditional  nodal-based  field  components  with  tangential 
fields  on  the  element  edges  as  the  degrees  of  freedom, 
these  have  also  been  widely  known  as  edge-based  ele¬ 
ments;  for  higher-order  elements,  there  are  also  face- 
and  body-based  degrees  of  freedom.  Other,  historical 
names  are  Whitney/Nedelec  elements. 

In  1980,  Nedelec  published  a  very  significant  and 
widely-referenced  paper  which  introduced  these  ele¬ 
ments  for  FE  computations  [1].  The  paper  remains 
formidable  reading;  Silvester  and  Ferrari  summarise 
the  situation  well  as  follows;  “Since  1980,  the  ideas 
contained  in  Nedelec’s  paper  have  gradually  filtered 
into  the  practical  finite  elements  community,  particu¬ 
larly  amongst  those  concerned  with  electromagnetics” 
[2,  p.295].  Amongst  the  earlier  workers  to  use  vector- 
based  finite  elements  were  Bossavit,  Cendes,  Crowley 
et  al.,  and  Lee  et  al.;  references  to  these  and  other  pi¬ 
oneers  may  be  found  in  Webb’s  excellent  review  paper 
[3]. 

Our  paper  addresses  both  2D  and  3D  vector  FE 
analyses  of  eigenvalue  problems,  using  higher-order 
hierarchal  elements.  Although  2D  problems  can  be 
worked  as  scalar  TE  or  TM  problems,  it  is  instructive 
to  study  the  properties  of  vector  elements  in  2D,  which 
motivated  the  2D  work  to  be  presented.  The  eigenvalue 
analysis  of  waveguiding  structures  will  be  reported,  in¬ 


cluding  an  analysis  of  single  and  dual-ridged  waveg¬ 
uide;  results  will  be  presented  and  compared  to  a  ven¬ 
erable  approximate  analysis.  The  paper  concludes  with 
theoretical  results  giving  explicit  formulas  for  the  ma¬ 
trix  entries  for  a  3D  second  order  hierarchal  element. 

As  will  be  discussed,  higher-order  vector  elements 
are  not  unique,  and  the  two  major  choices  facing  im¬ 
plementors  are  whether  to  adopt  hierarchal  or  interpo- 
latory  elements.  Savage’s  work  indicates  that  the  latter 
may  be  preferable  from  the  viewpoint  of  matrix  con¬ 
dition  number  [4],  and  Graglia  et  al  have  published 
a  systematic  approach  to  deriving  higher-order  inter- 
polatory  elements,  including  curvilinear  elements  [5]. 
However,  hierarchal  elements  remain  very  attractive 
for  research  codes  using  commercial  mesh  generators, 
where  mesh  refinement  is  simpler  by  enriching  element 
order  (p-adaptation)  as  opposed  to  h  adaptation.  (A 
discussion  of  error  estimates  and  h—p  mesh  adaptation 
may  be  found  in  [6],  [7]). 

II.  TWO  DIMENSIONAL  VECTOR  FINITE  ELEMENTS 
A.  Hierarchal  elements 

The  simplest  basis  functions  are  the  constant  tan¬ 
gential  /  linear  normal  (CT/LN)  elements.  It  is  by 
now  well  known  in  the  computational  electromagnet¬ 
ics  community  that  vector  finite  elements  are  of  mixed 
order;  many  authors  use  fractional  order  to  indicate 
this,  and  such  elements  are  often  referred  to  as  of  or¬ 
der  one-half;  an  alternate  nomenclature  is  H0(curl). 
(This  is  an  extension  of  Nedelec’s  notation).  There  is 
no  debate  over  the  lowest  order  element:  the  degrees  of 
freedom  of  the  Ho  (curl)  element  are  the  line  integrals 
of  the  tangential  field  along  each  edge.  (Since  this  is 
a  constant  times  the  edge  length,  the  edge  length  is 
sometimes  dropped  in  the  literature  [4]).  The  same  is 
not  so  for  higher-order  elements.  The  reason  is  that  the 
element  must  satisfy  what  have  become  known  as  the 
Nedelec  constraints;  these  are  contained  in  the  defini¬ 
tion  of  the  linear  space  7Lk  [1,  Definition  2].  The  basis 
functions  must  satisfy  these  constraints  [1,  Definition 
4].  Higher-order  elements  permit  several  choices  of  ba¬ 
sis  function  satisfying  these  criteria. 
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Fig.  1.  Edge-based  LT/QN  functions  and  the  edge  directions 
for  a  triangular  element.  These  are  the  directions  of  the 
basis  functions  on  the  element  edges  only.  Reprinted,  with 
permission,  from  [8]  ©1998  IEEE. 

When  a  more  accurate  approximation  of  the  field 
variable  is  required,  the  linear  tangential  /  quadratic 
normal  (LT/QN)  (.Hi  (curl))  shown  in  Table  I  may  be 
used.  This  is  a  hierarchal  element;  it  contains  the 
three  Ho  (curl)  edge-based  functions,  which  is  then  “en¬ 
riched”  using  three  additional  edge-based  and  two  ad¬ 
ditional  face-based  functions,  resulting  in  eight  degrees 
of  freedom.  These  last  functions  have  no  tangential 
components  on  any  of  the  edges,  in  this  way  not  af¬ 
fecting  tangential  continuity  between  elements  which 
was  established  by  enforcing  tangential  continuity  for 
the  first  six  basis  functions.  However  these  face  func¬ 
tions  add  normal  components  to  two  of  the  three  edges, 
which  introduces  the  quadratic  normal  components. 
Note  that  the  choice  of  which  edge/face  combination 
to  use  is  arbitrary. 


Order 

Basis  functions  i 

CT/LN 

wk  = 

AiVAj  -  Xj  VAi 

k  =  1,2,3 

(  AiVAj+AjVAi 

k  =  4,5,6 

LT/QN 

wk  = 

^  (VAj)Aj-Afc 

k  =  7 

l  (VAjJAiA* 

k  =  8 

TABLE  I 

Hierarchal  2D  basis  functions.  Note  that  the  LT/QN 
ELEMENTS  INCLUDE  THE  CT/LN  FUNCTIONS. 


B.  FE  Eigenvalue  Analysis 
The  FE  analysis  of  an  eigenvalue  problem  results  in 
the  following  generalised  eigenmatrix  equation: 

[S][e)  =  k*[T][e]  (1) 

The  mathematical  process,  and  the  notation,  have 
been  comprehensively  described  previously:  a  conve¬ 
nient  summary  may  be  found  in  [9].  The  resulting 


Exact  TE 
wavenumbers 

CT/LN 

result 

LT/QN 

result 

1.841 

1.8468;  1.8468 

1.8399;  1.8450 

3.054 

3.0641;  3.0641 

3.0435;  3.0538 

3.831 

3.8404 

3.8169 

4.19 

4.2093;  4.2093 

4.1760;  4.1808 

5.317 

5.3405 

5.2348;  5.2556;  5.2716 

5.331 

5.3470;  5.3470 

5.3318 

TABLE  II 

The  first  six  TE  modes:  exact  and  calculated 

WAVENUMBERS  FOR  CIRCULAR  WAVEGUIDE  RADIUS  lM.  THE 
GUIDE  CROSS-SECTION  WAS  DISCRETISED  INTO  200  ELEMENTS. 


eigenvalues  for  a  2D  problem  (with  PEC  boundaries) 
are  the  eigenvalues  of  the  waveguide  modes,  i.e.  the 
cut-off  frequencies  of  each  mode. 

B.l  Circular  waveguides 

A  code  developed  by  the  authors  was  applied  to  cal¬ 
culate  the  cutoff  frequencies  of  propagating  modes  in 
circular  cylinders  of  infinite  length  and  radius  1  m,  for 
both  the  TE  and  TM  modes.  The  mesh  generation  was 
done  by  the  use  of  a  Delaunay  triangulation  function 
available  in  Matlab  Version  5. The  accuracy  of  CT/LN 
and  LT/QN  is  examined  by  comparing  the  values  cal¬ 
culated  with  the  FEM  technique  with  the  exact  cutoff 
frequencies.  From  Table  II  it  can  be  seen  that  the 
LT/QN  solutions  are  indeed  more  accurate  than  the 
CT/LN  solutions.  The  multiplicity  of  eigenvalues  has 
also  been  reported  by  another  worker  [10] . 

B.2  Ridged  Waveguide 

Ridged  waveguides  are  known  to  lower  the  cutoff  fre¬ 
quency  of  the  TEio  mode  while  increasing  the  cutoff 
frequencies  of  the  higher-order  modes,  resulting  in  a 
larger  single-mode  bandwidth.  This  is  especially  use¬ 
ful  for  wideband  horn  feeds  for  microwave  antennas. 
Possible  configurations  are  single-ridged  (Fig.  2),  dual- 
ridged  (Fig.  3)  and  quadruple-ridged  waveguides  [11, 
p.  457].  We  show  here  applications  of  our  code  to 
dual-  and  single-ridged  waveguides;  to  the  best  of  our 
knowledge,  this  particular  application  is  novel,  albeit 
obvious. 

B.3  Dual-ridged  Waveguide 

The  waveguide  parameters  were  b/a  =  0.45,  d/b  = 
0.5  and  s/a  =  0.4.  The  cross-section  was  discretised 
into  a  uniform  mesh  of  64  triangular  elements. 

No  analytical  solution  is  available  for  this  type  of 
waveguide,  but  reliable  approximations  are  available 
for  the  cutoff  frequency  of  the  first  mode  [12].  These 
results  have  a  maximum  error  between  1  and  3  percent 
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Pyle’s  TE  cutoff 
wavelength  (^) 

CT/LN  result 
(*) 

LT/QN  result 
(¥) 

2.652 

2.7338 

2.664 

- 

1.0718 

1.0200 

- 

0.8816 

0.8772 

0.8807 

0.8764 

- 

0.6625 

0.6448 

TABLE  III 

The  TE  modes:  Pyle’s  approximation  for  TEio  cutoff 

WAVELENGTH  (NORMALISED  BY  a )  AND  FEM  RESULTS  FOR  A 
DUAL-RJDGED  WAVEGUIDE. 


Pyle’s  TE  cutoff 
wavelength  (^) 

CT/LN  result 
(¥) 

LT/QN  result 
(¥) 

2.785 

2.8318 

2.7914 

- 

1.1983 

1.1666 

- 

0.8806 

0.8742 

- 

0.7991 

0.8001 

- 

0.656 

0.6433 

TABLE  IV 

The  TE  modes:  Pyle’s  approximation  for  TEio  cutoff 

WAVELENGTH  (NORMALISED  BY  a)  AND  FEM  RESULTS  FOR  A 
SINGLE-RIDGED  WAVEGUIDE. 


for  the  worst  case  where  s/a  «  1-  The  results  are 
compared  in  Table  III. 


s 


..i 


a 

Fig.  2.  Dual-ridged  waveguide. 


B.4  Single-ridged  Waveguide 
The  waveguide  parameters  were  b/a  =  0.45,  d/b  = 
0.5  and  s/a  =  0.4.  The  cross-section  was  discretised 
into  a  uniform  mesh  of  128  triangular  elements. 

Once  again  the  result  for  the  TEio  is  given  in  [12]. 
The  results  are  compared  in  table  IV . 


s 


b 


a 

Fig.  3.  Single-ridged  waveguide. 


III.  Three-dimensional  vector  finite 

ELEMENTS 

Whilst  defining  the  higher-order  elements  is  essen¬ 
tial,  explicit  forms  for  the  contributions  of  the  ma¬ 


trices  derived  from  the  variational  functional  are  ex¬ 
tremely  useful  when  implementing  a  FE  code.  Lee 
and  Mittra  published  very  convenient  expressions  for 
ffo(curl)  elements  [9];  the  first  author  recently  ex¬ 
tended  this  work  [13].  Savage  and  Peterson  presented 
a  more  succinct  approach  in  [14];  they  included  ele¬ 
ments  up  to  #2  (curl)  (Quadratic  tangential  /  cubic 
normal)  and  gave  explicit  forms  for  the  ifo(curl)  and 
Hi  (curl)  elements.  Their  elements  are  not  hierarchal, 
and  were  also  not  constructed  along  a  specific  inter- 
polatory  scheme.  Savage  went  on  to  publish  both  in- 
terpolatory  and  hierarchal  vector  basis  functions  up  to 
and  including  fl^curl)  [4],  and  Peterson  went  on  to 
publish,  with  Graglia  and  Wilton,  a  full  interpolatory 
scheme  for  both  curl-curl  and  div-div  elements  [5].  Nei¬ 
ther  of  these  papers  contained  explicit  formulas  for  the 
matrix  entries.  Another  set  of  hierarchal  elements  has 
also  been  recently  proposed  [15];  we  use  those  of  [4]  in 
the  present  paper. 

A.  Explicit  forms  for  3D  hierarchal  .Hi  (curl)  elements 

This  paper  will  now  present  explicit  forms  for  the 
J?i(curl)  interpolatory  elements,  given  in  element  form 
in  [4].  Despite  the  significantly  improved  convergence 
rate  of  higher-order  elements,  the  complexity  grows  so 
rapidly  that  a  user-oriented,  general-purpose  code  is 
arguably  unlikely  to  routinely  use  more  than  Hi  (curl) 
elements;  the  degrees  of  freedom  grows  as  [1,  p.323]: 

Nk  =  fc(k  +  2)(k  +  3)  (2) 

k  here  is  for  #fc_i(curl).  This  is  summarised  in  Ta¬ 
ble  V,  where  the  rapid  growth  in  degrees  of  freedom 
associated  with  the  higher-order  elements  is  readily  ap¬ 
parent. 

Tables  VI  andVII  summarise  the  higher-order  hi¬ 
erarchal  CT/LN  and  LT/QN  vector  basis  functions. 
This  is  based  on  [4,  Table  2].  Note  that  what  is  pre¬ 
sented  are  the  functions  for  one  edge,  one  face  and 
the  interior  tetrahedron  function.  The  overall  system 
replicates  this  for  each  of  the  six  edges  and  four  faces. 
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Order 

Degrees  of  freedom 

Fo(curl) 

6 

Hi  (curl) 

20 

H2(curl) 

45 

H3(curl) 

84 

#4  (curl) 

140 

TABLE  V 

Hierarchal  vector  BASIS  FUNCTIONS:  degrees  of  freedom 
ASSOCIATED  WITH  VARIOUS  ORDERS  OF  ELEMENT. 


Order 

Edge 

functions 

Face 

functions 

Tetra¬ 

hedron 

functions 

Bo  (curl) 
(CT/LN) 

AjVA2  -  A2VAj 
=  fli2  (==  el) 

Hi  (curl) 
(LT/QN) 

V[AiA2J(=e2) 

Ai^23  (=  fl) 
A2Oi3  (=  /2) 

TABLE  VI 

Hierarchal  vector  basis  functions:  explicit  forms.  Note 
THAT  THE  LT/QN  ELEMENTS  include  THE  CT/LN  functions. 


Explicit  forms  are  required  for  the  following  inte¬ 
grals,  once  the  relevant  basis  functions  are  substituted 
for  Bi  and  Bj  in  the  following: 

Ei:j  =  [  V  xBi-V  xBj  (3) 

Jv 

and 


Fa  =  f  St-  Bj  (4) 

Jv 

(The  symbols  [5]  and  [T]  respectively  are  also  widely 
used  for  these  matrices  [2],  [9]).  Savage  and  Peterson 
published  explicit  forms  for  a  closely  related  Hi  (curl) 
element;  we  have  extended  their  work  and  present  the 
results  here.  The  first  six  elements  in  Table  VI  are 
precisely  as  for  Ho  (curl)  —  because  of  course  of  the 
hierarchal  nature  of  the  elements.  However,  the  next 
six  edge  functions  and  eight  face  functions  are  new. 


Order 

Edge 

functions 

Face 

functions 

Tetra¬ 

hedron 

functions 

H0  (curl) 

o> 

X 

II 

— 

— 

Hi  (curl) 
additional 

6x1  =  6 

4x2  =  8 

TABLE  VII 

Hierarchal  vector  basis  functions:  extra  unknowns 

ADDED  PER  ADDITIONAL  ORDER 


The  face  functions  are  as  for  the  elements  tabulated 
in  [14].  The  edge  functions  however  are  different,  and 
the  explicit  forms  must  now  be  derived.  Note  that  the 
[E]  and  [F]  matrices  consist  now  of  several  types  of 
interaction,  between  the  various  element  functions. 

Consider  first  the  curl-curl  term,  eqn.  (3).  Following 
the  notation  of  [14],  the  interactions  can  be  expressed 
as  block  matrices  as  follows: 


E  = 


Eelel 

Ee2el 

Efiel 

Ef^el 


£e  le2 
£e2e2 
£/l*2 
£/2e2 


£elf  I  jjel/2 
££2/1  £62/2 
£/l/l  £/ 1/2 

£/2/l  £/2/2 


(5) 


A  similar  expression  pertains  for  [F].  Note  here,  how¬ 
ever,  that  el  and  e2  are  the  first  and  second  six  func¬ 
tions  of  Table  VI,  rather  than  functions  of  the  form 
AjiVA,2  as  originally  proposed  in  [4]. 

From  eqn.  (3),  it  is  clear  that  the  elements  of  eqn.  (5) 
are  symmetrical,  hence  only  the  sub-matrices  on  and 
above  (or  below)  the  diagonal  need  be  explicitly  eval¬ 
uated.  The  term  Eelel  is  as  for  the  CT/LN  element 
and  was  evaluated  in  [14,  eqn.  10].  The  additional  edge 
functions  of  the  form  V(AjAj)  (e2  in  the  above)  make 
no  contribution,  since  V  x  V<f>  =  0  V</>.  This  is  true 
for  all  terms  involving  the  e2  functions.  Thus  the  fol¬ 
lowing  elements  are  zero:  Eele2  =  Ee2*1  —  Ee2f2  =  0 
(and  of  course  the  corresponding  elements  below  the 
diagonal). 

Now,  the  contribution  of  the  face  elements  to  [F] 
must  be  established.  Using  some  elementary  vector 
identities,  grouping  like  terms  in  the  same  fashion  as 
[14,  eqn. 10],  using  [14,  eqn.24]  and  finally  the  formulas 
for  integration  over  tetrahedra  [14,  eqn.  14],  one  obtains 
the  following  results: 


Etr 

= 

4V£iijVii'i2  ■  VJ1J2 

(6) 

Etr 

= 

0 

(7) 

Etjfl 

= 

~^tiVil,i2  •  {2Vj2,j3  +  Vjlj3  ~  V7IJ2] 

(8) 

E^fZ 

v 

= 

-^UVi\,i2  '  \Pj2,jZ  +  2vjl,j3  +  Vjl,j2] 

(9) 

= 

0 

(10) 

Et!fl 

= 

0 

(11) 

Etr 

= 

0 

(12) 

Eir 

= 

V (4Jkfji  ji  Vt2,t3  ■  Vj2,}3  +  2Miiij2U«2,j3 

•  VjljS 

—  4JyJil,j3Vi2,i3  jl  </»l,t3  •'jZ, 

+Afj2,j2Vil,i3  •  U/1,/3  “  •M»2,/3W*l,i3  •  Vj\j2 
-2Mi3'jlVil42  ■  Vj2,j3  -  Mi3,j2Vil,i2  '  Vjl,j 

+Mi3J3VU'i2  ■  Vjij*)  ( 
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=  V(2MnjlVi2,iS  ■  VJ2J3  +  4MilJ2^2,i3  ■  VjlJS 
+2Mn,j3Vi2,i3  ■  VJ1J2  +  Mi2,jlVn,i3  ■  Vj2,j3 
+2Mi2,j2Vil,i3  '  Vjl,j3  +  M{2,j3Vil,i3  '  Vjl>j2 
-Mi3tjiVn,i2  ■  Vj2,j3  ~  2Mi3,j2Vil,i2  ■  Vjl,j3 
-Mi3tj3Vn,i2  -Vjl,j2)  (14) 

E(jf~  =  V(Mn,j\Vi2ti3  ■  Vj2,j3  +  2Miij2Vi2,i3  ■  VjlJS 
+Mn,j3Vi2,i3  •  Vji,j2  +  2Afajltrii,a  '  Vj2,j3 
+4Mi2,j2^»l,i3  •  Vjl,j3  +  2Mi2,j3^il,t3  '  ®#lj2 

+A/i3JlUil,»2  •  Wj2  jS  +  2M« ^2Wil,t2  •  VjflJS 

+Afi3,y3Wtl,i2  •  Vjl,i2)  (15) 

Terms  such  as  AfiJt  vitj  etc.  are  as  given  in  [14]. 1 The 
results  involving  face  elements  only  (the  last  three  sub¬ 
matrices  above)  are  identical  to  [14,  eqns.(31,34  &  35)]. 
They  have  been  double-checked  by  the  present  author 
and  are  correct  as  published.  The  results  involving 
edge  elements  above  differ  from  [14],  in  some  cases  sub¬ 
tly  so. 

The  contribution  to  the  div-div  term,  eqn.  (4),  is 
straightforward  albeit  tedious  to  derive.  The  result  is: 


F^l'e2  respectively,  and  similarly  for  the  other  such 
expressions. 

The  results  for  the  face  element  submatrices  (the  last 
three  above)  are  identical  to  [14,  eqns.(41,44  Sc  45)]; 
again,  these  results  have  been  checked  by  the  present 
authors  and  are  correct  as  given.  fcj  is  defined  in  [14, 
eqn.(8)]. 

The  integration  matrices  and  Pi,j,k,i  are  as  de¬ 
fined  in  [14,  eqns.(46  Sc  47)];  these  are  not  quite  as 
straightforward  as  that  of  the  Hi  (curl)  elements  [14, 
eqn. (15)].  The  reason  is  that  the  matrix  entries  are 
in  turn  sub-matrices,  since  each  face  i  has  three  nodes 
(and  hence  three  simplex  coordinates)  associated  with 
it.  As  an  example,  the  sub-matrix  for  faces  i  =  2  and 
j  =  3,  (with  associated  nodes  {1,2,4}  and  {1,3,4} 
respectively,  using  the  numbering  convention  of  [14, 
Table  II])  is  as  follows: 


[^2,3]  - 


pe2/el,el/e2 


— +  <f>il,j\Mi2,j2}  (16) 

Vlilj[<j>i2J2Mii,jl  T  &2jlAftt J2 

32^12,3!  ~  <Pil,jlMi2,j2]  (17) 


1 

840 


Pl,2,l,3  -Pi, 2, 1,4  Pl,2,3,4 

Pi, 4,1, 3  Pi, 4, 1,4  Pi  ,4,3,4 

P2,4,l,3  P2,4,l,4  p2,4,3,4  J 

2  2  1 


2  4  2 
1  2  2 


(24) 


Such  matrices  can  be  either  pre-computed,  or  com¬ 
puted  on  the  fly  by  inspection. 

IV.  Conclusions 


pe2  ,e2 

=  V£iij[<l>i2,32MilJl  +  0»2,jlAf»l,j2 

+<f>HJ2Mi2,jl  +  4>il,jlMi2,j2]  (18) 

F«l/2,/l 

Fij 

-  V£i(0i2,j3Nil,jl,j2  =F  0il,j3^i2,jl,j2 
-<Pi2,j2Nn,jl,j3  ±  4*il ,j2 77i2,j  1  ,j3 ) 

(19) 

Fel/2J2 

=  V ti (4>i2,j3 Nil ,jl  j'2  T  0tl,j3-Nt2,jlj2 
—<Pi2,jlNil,j2,j3  ±  4>i\,j\Ni2,j2j3) 

(20) 

r/1,/1 

y 

=  V[0i3J3-Ptl,i2,jl,j2  —  0i2,j3-ftl,«3,jl,j2 
~d>i3,j2-Pil,«2,jl,j3  +  0i2J2-P«l,i3,jl,j3] 
(21) 

r/l,/2 

=  V  [<Pi3,j3  P\l  ,i2,j  1  j’2  —  4>i2,j3Pil,i2,jl,j2 

~4>i3,jlPil,i2,j2,j3  +  &2,jl-ftl,i3,j2  J3] 
(22) 

f/2,/2 

v 

=  V[^3,j3p£l,t2,jl,j2  ~  <f>il,j3Pi2,i3,jl,j2 

~4>i3jlPil,i2,j2,j3  +  <t>il,3lPi2,i3,j2,j3] 

(23) 

Due  to  the  similarity  in  form,  the  results  for  el  and 
e2  are  presented  simultaneously  in  the  above;  the 
and  ±  in  jr'2/el-el/e2  (eqn.  (17))  refer  to  F-f'el  and 


This  paper  has  addressed  several  aspects  of  higher- 
order  vector  finite  elements,  concentrating  on  eigen¬ 
value  applications.  Two-dimensional  results  have  been 
presented  for  both  a  standard  test  problem  (a  circu¬ 
lar  waveguide) ,  as  well  as  a  novel  application  to  ridged 
waveguide.  Theoretical  three  dimensional  results  have 
been  presented,  giving  explicit  forms  for  hierarchal 
H\  (curl)  elements.  This  is  a  logical  extension  of  previ¬ 
ous  work  [14].  Such  explicit  forms  are  extremely  useful 
when  implementing  a  FEM  code. 

Progress  on  the  implementation  and  application  of 
these  3D  ffi(curl)  elements  wiU  be  reported  at  the 
symposium.  The  authors  already  have  a  working  3D 
tfo(curl)  FE  code;  results  for  cavity  eigenvalue  prob¬ 
lems  computed  using  it  were  presented  in  [13]. 
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Local  Error  Estimation  for  High-Frequency  Problems  Using 
Hierarchical  Tangential  Vector  Finite  Elements 

Scott  Savage 
John  Manges 
Ansoft  Corporation 

Abstract  -  This  paper  applies  the  method  of  local  error  estimation  to  high-frequency  finite 
element  applications.  Local  error  estimation  is  useful  in  determining  the  validity  of  a  finite  element 
solution  on  an  element  by  element  basis,  and  has  previously  been  applied  to  scalar  finite  element 
analysis.  Applying  the  method  to  high-frequency  problems  is  possible  due  to  the  recent  development 
of  high-order,  hierarchical,  vector  basis  functions.  This  paper  will  provide  a  formulation  of  the  local 
error  estimation  method  and  present  numerical  results  which  demonstrate  the  efficiency  of  the  method. 

I.  Introduction 

It  is  well  known  that  high-order  vector  finite  elements  can  efficiently  solve  high-frequency 
electromagnetic  problems.  However,  field  singularities,  which  are  present  in  most  finite  element 
applications,  can  reduce  the  convergence  rates  of  any  order  finite  element  analysis.  Furthermore, 
employing  singular  basis  functions,  which  is  common  in  2-D  problems,  is  difficult  to  automate  in  3-D 
due  to  the  unknown  rate  of  singularities  in  arbitrary  problems.  For  these  problems,  traditional  meshing 
using  a  wavelength  criterion  can  be  very  inefficient.  This  paper  applies  the  method  of  local  error 
estimation  to  high-frequency  problems.  The  method  identifies  those  areas  of  a  finite  element  mesh  in 
which  the  solution  is  poor,  so  that  some  type  of  adaptive  refinement  (n-type  or  p— type)  can  be  applied 
there. 

The  adaptive  finite  element  method  relies  on  a  good  refinement  indicator  which  quantitatively 
ranks  the  elements  (tetrahedra)  in  a  mesh  based  on  the  solution  accuracy.  Some  refinement  indicators 
previously  used  in  high-frequency  problems  include  residual  based  indicators  and  local  error 
estimation  [1,2]. 

The  technique  of  local  error  estimation  is  to  resolve,  with  increased  accuracy  and  less  expense,  the 
fields  on  subdomains  of  the  original  problem.  Under  the  assumption  that  the  solution  of  the  local 
problem  is  significantly  more  accurate,  the  difference  between  the  local  problem  solution  and  the 
original  global  solution  is  an  estimate  of  the  error  in  the  global  solution. 
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To  obtain  a  more  accurate  local  solution  on  a  subdomain,  one  can  either  use  more  dense  meshing, 
higher  order  basis  functions,  or  a  combination  of  the  two.  Sense  hierarchical  basis  functions  are 
available  [5,6],  the  second  possibility  is  more  desirable.  First,  higher  order  basis  functions  offer  better 
accuracy  per  degree-of-freedom.  Second,  a  hierarchical  solution  avoids  the  need  for  additional  mesh 
manipulation.  This  method  has  been  successfully  applied  to  scalar  problems  in  quasi-statics  [2,3]. 

n.  The  Local  Error  Problem 

As  a  model  problem,  consider  the  determination  of  the  fields  and  resonant  frequencies  for  an 
enclosed  cavity.  The  finite  element  solution  of  the  weak  form  of  the  vector  Helmholtz  equation, 

JjJVxT  -VxE -k2T -E  =  j^T  -nxVxE ,  (1) 

results  in  a  generalized  Eigenvalue  problem,  where  E  and  k  are  determined  simultaneously.  Since 
k  is  variationally  stationary,  it’s  solution  is  more  accurate  than  E .  In  the  local  error  estimation,  k  is 
assumed  to  be  exact. 

The  exact  solution,  E ,  obeys  (1).  Let  Eh  denote  the  field  corresponding  to  an  approximate  finite 
element  solution.  The  global  error  function  e  is  defined  by 

e  =  E-Eh.  (2) 

Clearly,  if  e  is  known  (or  can  be  accurately  approximated),  then  the  exact  solution  is  known.  Let  E‘ 

denote  the  approximate  solution  to  the  local  problem  for  E .  Then, 

eh  -El-Eh,  (3) 

is  an  estimate  of  e  .  It  then  follows  that 

(4> 

In  other  words,  the  error  in  estimation  of  e  by  (3)  is  equal  to  the  error  in  the  local  problem.  So,  the 
goal  is  to  solve  the  local  problem  more  accurately  than  was  done  in  the  global  solution. 

A  deterministic  local  problem,  defined  in  (1),  is  solved  on  each  tetrahedron  using  the  approximate 
boundary  fields  and  k  from  the  original  global  solution.  In  doing  this,  either  Neumann  or  Dirichlet 
boundary  conditions  could  be  used.  In  this  paper,  the  Neumann  boundary  condition  is  used  as  this 
allows  full  use  of  the  degrees-of-freedom  of  the  high-order  element. 

IE.  Hierarchical  Vector  Elements 

Vector  basis  functions  have  proven  to  be  the  best  choice  in  solving  high-frequency  finite  element 
problems  [4].  Hierarchical  vector  basis  functions  have  recently  been  extended  to  high  polynomial 
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orders  [5,6].  Table  1  presents  the  basis  functions  used  in  this  study.  The  basis  functions  are  of  the 
type  first  proposed  by  Nedelec  [7],  The  H0(curl)  functions  are  the  lowest  order  vector  functions,  often 
referred  to  as  edge-elements.  Hierarchical  functions  through  H3(curl)  are  given.  These  basis  functions 
allow  the  straight-forward  solution  of  (1)  on  the  interior  of  a  single  tetrahedron.  The  discretized 
version  of  (1)  is 

(S-*2T)bt  =  Bp,  (5) 

where  o  is  a  vector  of  unknown  coefficients,  and  P  is  a  vector  of  known  coefficients  from  the 
solution  of  the  global  problem,  augmented  by  zeros  for  the  higher-order  basis  functions  used  in  the 
local  problem.  The  matrices  S,  T,  and  B  are  the  element  matrices  for  the  tetrahedron.  Equation  (5)  is 
solved  using  full— matrix  techniques  for  a.  The  approximate  error  field  is  then  integrated  over  the 
tetrahedron  to  determine  the  refinement  indicator, 

r  =  JJ!e*'F*=(a-p)rT(a-|>).  <6> 

This  indicator  is  computed  for  all  tetrahedra  to  determine  where  adaptive  meshing  should  be  applied. 

IV.  Numerical  Results 

To  demonstrate  the  advantage  of  intelligent  adaptive  refinement,  the  method  of  local  error 
estimation  was  applied  to  the  analysis  of  a  resonant  cavity  with  reentrant  comers.  The  cavity  was 
formed  by  placing  a  small  perfectly  conducting  cube  in  the  comer  of  another  cubic  cavity.  The 
dominant  mode  of  this  cavity  possesses  a  singularity  near  the  comer  of  the  small  cube.  The 
wavenumber,  k,  of  this  mode  was  determined  to  be  3.588  on  a  well  refined  mesh.  Fig.  1  shows  the 
convergence  of  the  finite  element  method  using  H0(curi)  as  more  and  more  tetrahedra  are  applied  to 
the  problem.  The  local  error  estimation  method  using  Hi(curl)  on  the  interior  is  compared  to  uniform 
refinement  where  the  goal  is  to  keep  all  tetrahedra  approximately  the  same  size.  As  the  solution 
approaches  convergence,  it  is  clear  that  intelligent  mesh  refinement  is  necessary  for  efficient  solutions. 
Fig.  2  shows  the  same  analysis  where  Hi  (curl)  functions  were  used  globally  and  H2(curl)  locally.  Fig. 
3  uses  H2(curl)  functions  globally  and  H3(curl)  locally.  In  all  cases,  local  error  estimation  provides 
more  efficient  meshing  than  uniform  refinement. 

One  interesting  trend  appears  with  the  coarse  initial  meshes.  It  appears  that  uniform  refinement 
initially  is  more  efficient  than  intelligent  refinement.  It  may  seem  logical,  then,  to  use  uniform 
refinement  initially,  followed  by  refinement  using  local  error  estimation.  Fig.  2  presents  results  from 
applying  this  technique.  It  appears  that  this  approach  initially  provides  better  solutions  with  fewer 
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unknowns.  However,  as  the  mesh  is  further  refined,  the  local  error  estimation  method  eventually 
overtakes  the  combined  approach.  Thus,  it  seems  that  using  the  local  error  estimation  method  from 
the  initial  mesh  ultimately  provides  the  most  efficient  finite  element  mesh. 


V.  Conclusion 

In  this  paper,  the  method  of  local  error  estimation  has  been  applied  to  high-frequency  finite 
element  solution  of  electromagnetic  problems.  This  was  possible  due  to  the  recent  development  of 
high  order,  hierarchical,  tangential  vector  finite  elements.  It  was  demonstrated  that  this  method  can  be 
used  to  improve  adaptive  mesh  refinement  over  wavelength  based  mesh  refinement.  It  still  remains  to 
compare  this  approach  to  other  mesh  refinement  indicators  including  residual  based  methods. 
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Table  2. 

Hierarchical  Vector  Basis  Functions 


Order 

Edge  Functions 

Face  Functions 

Tetrahedron  Functions 

Ho(curl) 

W-W=Q„ 

N/A 

N/A 

Hi  (curl) 
Additional 

VlU] 

^1^23 

^2^13 

N/A 

H2(curl) 

2^Q23,  -^2^13  >  -^3^12 

^1^2^34 

Additional 

V[A,A,A,] 

^2^3^14 

^3^4^12 

H3(curl) 

V[A1A2(2A,-A2XA,-2A2)J 

43023  ’  ^2^13  »  -^3^12  ’ 

AjA2A3Q14,  ^^24 

Additional 

^1^3^12 

vfeAj,  v[a,.M?] 

^■1^3^4^12  ’  ^1^3^24 
■^2^3^14’  ^2^3^14 

v^a^aJ 
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Abstract 

A  finite  element  formulation  is  used  to  compute  the  time-varying  inductance  associated  with  a 
multi-conductor  transmission  line  with  saturable  magnetic  materials.  A  time  domain  finite  difference 
solution  of  the  transmission  line  equations  is  coupled  to  a  circuit  simulator  based  on  algebraic 
companion  form  equations.  The  approach  is  intended  for  the  modeling  of  transient  signals  on  power 
distribution  networks. 


Introduction 

Several  applications,  including  power  distribution  networks  [1],  depend  on  circuit  analysis 
tools,  such  as  Spice™,  in  order  to  model  their  systems.  Integrating  a  full  electromagnetic  model  into 
such  simulation  tools  requires  particular  adaptation  of  the  models’  interfaces.  Transmission  lines  form 
the  backbone  of  most  distribution  networks,  and  therefore  act  as  critical  links  in  the  simulation.  Skin 
effect  problems  have  been  studied  for  simple  multiconductor  geometries  using  the  transmission  line 
matrix  method  exclusively  [2].  In  order  to  provide  a  more  accurate  estimation  of  signal  propagation  on 
transmission  lines,  a  combination  of  time  domain  transmission  line  matrix  and  nonlinear  finite  element 
analysis  is  required.  This  allows  for  a  more  rigorous  transient  analysis  that  incorporates  geometry, 
conductivity,  and  nonlinear  ferromagnetic  effects.  The  transient  finite  element  analysis  using  surface 
impedances  has  been  achieved  via  the  fast  Fourier  transform  [3];  however,  since  circuit  simulators 
require  time  marching  analysis,  only  a  truly  transient  finite  element  solution  will  fit  [4], 

Since  the  algebraic,  or  resistive,  companion  form  of  circuit  equations  is  commonly  used,  the 
electromagnetic  model  must  produce  a  terminal  interface  that  fits  that  form.  The  dependent 
relationships  between  the  full  network  solver,  the  transmission  lines  matrix,  and  the  finite  element 
model  are  demonstrated. 

The  Transmission  Line  Matrix  Model 

Since  a  transmission  line  may  be  decomposed  into  a  simple  longitudinal  and  a  complex  cross- 
sectional  geometry,  it  is  reasonable  to  solve  a  different  set  of  equations  in  each  space.  The 
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voltages  may  be  determined  for  the  previous  currents  over  the  adjacent  segments  and  the  past  node 
voltage. 

In  a  nonlinear  system,  the  inductance  matrix  [1]  is  a  function  of  local  segment  currents.  [1]  is 
then  also  a  function  of  position  and  time.  For  a  K-segment  line,  K  separate  [1]  matrices  exist  down  the 
transmission  line  at  every  time  step,  and  are  updated  after  the  new  segment  currents  are  determined. 


The  highlighted  solution  domain  illustrates  the  operations  performed  at  each  iteration.  Using  this 
approach,  signals  propagating  in  both  directions  on  the  transmission  line  are  accounted  for. 


Interface  to  the  Algebraic  Companion  Form 

The  algebraic  companion  form  [5,6]  is  a  system  of  writing  circuit  equations  into  a  matrix  so 
that  the  interaction  of  multiple  local  components  or  modules  may  be  impressed  into  a  global  matrix 
whose  solution  is  the  terminal  parameters  (node  and  current)  of  all  the  modules  in  the  system.  This 
allows  for  the  integration  of  a  large  number  of  separate  functioning  modules  into  a  large  network 
solver. 
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Each  terminal  node  is  assigned  a  voltage  and  current;  the  module  has  to  return  the  expected 
relationship  between  the  nodal  voltage  and  current.  For  example,  a  module  consisting  of  two  resistive 
elements  (Figure  2)  will  result  in  a  system  of  equations  of  the  form 


Vl 


Rl3  v3 

A  A  A  /  \ 

'vVvO 


Figure  2.  A  module  consisting  of  two  resistive  elements 


The  system  of  equations  may  be  rewritten  as 


h 

h 

h 

h. 


*,3  0  *13 

1 

0 


*2, 

0 


0 


0 


*24  J 


relating  the  terminal  currents  to  voltages.  The  basic  algebraic  companion  equation  must  fit  into  the 
mask  {/}AT  =  [r]WxAT  {V}^  -  {#}Ar  where  NT  is  the  total  number  of  module  terminals.  In  the  simple 
case  of  the  double  resistor  example 
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transmission  line  matrix  model  is  capable  of  simulating  signal  propagation  down  an  arbitrary  line 
given  the  line  matrices.  These  are  the  [R],  [L],  [G],  and  [C]  matrices:  resistance,  inductance, 
conductance,  and  capacitance  respectively.  These  account  for  the  interaction  between  all  the 
transmission  lines  using  the  following  set  of  equations 


For  our  application,  conductance  may  be  assumed  negligible  compared  to  capacitance.  Applying  a 
basic  finite  difference  scheme  to  the  transmission  line  equations  results  in  the  following  system  of 
equations 


i(n  +  \,k  +  \)~i(n  +  \,k~\) ^  , 
Az  1 


’  LJ  At 

[c]  + 1’  “  V(W’ 


where  [r],  [1],  and  [c]  are  the  per-unit-length  segment  resistance,  inductance  and  capacitance.  The 
transmission  line  equations  may  be  rewritten  such  that  a  leapfrog  finite  difference  approach  may  be 
employed  to  model  signal  propagation  in  the  transmission  line.  The  first  equation  may  be  manipulated 
into  the  current  update  form 


2[/]+A([r] 


2[/]-Ar[r] 


2[/]+A  t[r 


{v(«,£  +  l)-v(«,  A:)} 


Similarly,  the  voltage  update  equation  is  determined  to  be 

v(w  + 1,*)  =  {/(« +j,k  +  \)-i(n  +  \,k- i)}+  v(«,  k) 

The  full  transmission  line  is  discretized  into  small  segments  based  on  the  time  interval.  It  is 
also  possible  to  determine  the  time  interval  based  on  the  segment  length;  however,  since  the  model 
must  fit  into  the  larger  scheme  of  a  circuit  solver,  a  preset  time  step  is  assumed  to  be  inherited  and 
fixed.  In  the  absence  of  feedback  loops,  the  segment  length  must  be  determined  using  the  Courant 
limit  cAt<Az.  Figure  1  illustrates  the  solution  evolution  as  a  function  of  position  and  time.  Currents 
are  determined  over  the  segments  at  half  interval  steps,  while  voltages  are  available  at  the  nodes  at  full 
time  steps.  The  external  nodes  of  the  geometry  form  the  terminal  nodes  where  the  input  ‘driving’ 
voltage  is  impressed.  The  dependence  of  currents  and  voltages  is  determined  by  the  current  and 
voltage  update  equations  in  a  leapfrog  scheme.  Therefore,  each  new  segment  current  is  dependent  on 
the  old  current  at  that  location,  as  well  as  the  old  node  voltages  of  the  segment.  Similarly,  node 
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In  more  complex  systems  involving  either  voltage  or  current  lagging,  the  {B}  vector  relates  to  past 

history  terms. ^  ^  ^  ^  transmission  line  model  into  the  algebraic  companion  form,  a  number  of 
manipulations  must  be  made.  The  network  will  supply  the  termini  node  voltages  and  currents .{V} 
and  {1}  and  expect  the  foil  system  admittance  [Y]  and  history  {B}.  The  transmission  line  will  keep  its 
own  internal  history  of  nodal  voltages  {v},  segment  currents  {i>  and  local  inductances  [1], 

Given  the  new  terminal  voltages,  the  segments’  new  currents  may  be  determined.  In  each 
segment,  this  forcing  current  may  then  be  imposed  on  the  cross-sectional  geometty  via  the  nonlinear 
finite  element  method.  A  foil  magnetic  analysis  of  the  region  (incorporating  the  effects  of  skin  depth 
conductivity,  and  nonlinear  ferromagnetism)  will  yield  the  new  local  inductance  matnx  {1}. 
internal  nodal  voltages  are  then  determined  using  the  voltage  update  equation. 

Sin-,  the  algebraic  companion  forms  of  [Y]  and  {B}  relate  the  time  mstantaneous  termmtd 
current  and  voltage,  the  average  voltage-coincident  terminal  current  must  be  determined  using  central 
differences. 


2[i]+A/[r] 

lr 

2[/]-Ar[r] 

+  [1] 

l 

- 

- 

A t_ 
Az 


2[/]+A([r] 


v(w,fc  +  l)+ 


A t_ 
A z 


2[(]+At[r] 


v(n,k) 


Depending  on  the  terminal  (terminal  1  or  2  corresponding  to  k  =  1  or  k  -  K  -  total  number  of 
segments),  the  equations  may  be  written  as  functions  of  the  terminal  voltages  and  currents  (see  Fig.  1). 


IT]  =  +/(»,  1  +  i)  /„  =  -«(»,  K  +  i) 

Vn  =  +v(w,l)  VT2  =  +v(»,  K  + 1) 


so  that 
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which  may  be  rewritten  in  the  form  of  local  terminal  admittance  and  history  terms 

{ln  }kz  =  \yn  {VTI  -  {&n 


2[/J+A/[r] 


v(n,K) 


where 


2[A]+Ar[r] 

2[/K]+A;[r] 
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The  terminal  parameters  may  then  be  used  to  determine  the  network  interface  through  the  algebraic 
companion  form  of  the  equations 


so  that  the  full  module  admittance  and  history  are 
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Once  [Y]  and  {B}  have  been  determined,  they  are  returned  to  the  network  solver  where  all  other 
module  parameters  are  combined  and  solved  for  the  next  iteration. 


The  Nonlinear  Finite  Element  Solution 

A  surface  admittance  finite  element  formulation  for  the  transient  modeling  of  skin  effect  and 
eddy  current  problems  was  presented  at  ACES ‘97  [4].  This  formulation  used  a  transient  surface 
admittance  boundary  condition  and  employed  the  Prony  expansion  method  to  optimize  the  resulting 
convolution  integral.  This  approach  is  adapted  to  model  ferromagnetic  materials  using  an  iterative 
relaxed  solution  method  [7,8,9,10].  Since  the  surface  admittance  boundary  condition  incorporates 
material  properties,  it  should  be  readily  possible  to  treat  nonlinearity  within  the  surface  time 
integration.  The  iterative  approach  assumes  that  the  magnetic  permeability  at  the  present  time  step  is 
approximately  the  same  as  that  in  the  previous  one.  The  system  of  equations  is  solved  based  on  this 
assumption,  and  new  values  of  the  magnetic  flux  density  are  determined  in  the  nonlinear  surface 
boundary  elements.  These  new  values  of  the  element  magnetic  flux  density  result  in  new  values  of  the 
element  permeability.  The  calculated  permeability  using  an  associated  magnetization  curve  is  then 

Locate  ~  Mi^calc  ) 

A  new  permeability  may  then  be  determined  for  the  next  iteration  using 

/Vk  =  Mold  +  riMcalc  ~  Mold  ) 
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which  depends  on  the  past  iteration  permeability  jUoid,  the  calculated  permeability  /w,  and  the 
relaxation  factor  yfor  0  <  y<  1  .  The  literature  [9,10]  points  to  a  standard  value  of  the  relaxation 
factor  y=  0.1  .  The  relative  change  of  the  permeability  may  be  determined  over  successive  iterations 
as 


is.fi  = 


M* 


Mold 


MoW 


A  convergence  criterion  s  for  the  iteration  process  may  then  be  based  on  the  rms  value  of  the  relative 
changes  taken  over  the  total  number  of  nonlinear  boundary  elements 


s  = 


IM. 

NLE 


A  maximum  allowable  value  of  £^ax  =  0.01  is  found  acceptable  in  the  literature  [9,10].  Should  this 
value  be  larger  than  the  acceptable  pre-set  limit,  the  new  value  of  the  permeability  is  set  and  the 
solution  is  repeated. 

In  order  to  interface  with  the  transmission  line  matrix  formulation,  an  extraction  of  per-unit- 
length  inductance  is  necessary  [13].  The  extraction  of  this  matrix  [1]ncxNC  is  based  on  the  definition  of 
inductance  as  a  function  of  flux  and  magnetic  vector  potential  [11,12] 


u  isz-l,  1 ' 


Mi- 


+t=-"=Ave=° 


where  <pj0  is  the  flux  between  the  j*  line  and  the  reference  conductor,  and  /,  is  the  current  carried  on 
the  iA  line.  The  magnetic  vector  potential  Aj  on  the  surface  of  the  jte  conductor  is  readily  available 
from  the  finite  element  code.  Due  to  the  nonlinear  nature  of  the  problem,  the  converged  magnetic 
properties  of  the  fully  excited  problem  are  determined  and  then  fixed.  Each  conductor  is  then  excited 
separately  in  a  linear  regime  so  that  one  row  of  the  segment  inductance  matrix  may  be  determined. 
This  is  repeated  for  the  number  of  conductors  in  the  problem  NC  until  the  [1]  matrix  is  filled.  The 
remaining  matrices  [r]  and  [c]  are  pre-determined  and  assumed  constant. 


Complete  System  Interface 

The  individual  components  of  the  method  may  be  assembled  and  summarized  as  follows.  At 
each  time  step,  the  circuit  simulator  interfaces  to  the  transmission  line  model  by  sending  the  latest 
terminal  current  and  voltage  updates.  The  transmission  line,  having  the  full  data  set  of  former  currents 
and  voltages,  {i}  and  {v},  determines  the  next  set  of  internal  currents  and  voltages.  Using  the  new 
currents,  it  calls  the  transient  nonlinear  finite  element  method  to  determine  the  segment  inductance 
matrices  [1].  Using  the  terminal  segment  inductance  matrices  and  history,  the  full  admittance  matrix 
[Y]  and  past  history  vector  {B}  are  determined  and  returned  to  the  circuit  simulator.  The  circuit 
simulator  incorporates  [Y]  and  {B}  into  its  own  assembly  for  a  full  system  solution. 
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Conclusion 

Incorporating  a  full  magnetic  simulation  of  transmission  lines  into  a  circuit  simulator  based  on 
algebraic  companion  form  equations  is  presented.  Since  circuit  simulators  and  other  analysis  tools 
require  time  marching  responses  from  their  coupled  modules,  true  transient  analysis  is  applied  in  the 
transmission  line  formulation.  This  validates  the  need  for  time-marching  analysis  and  parameter 
extraction  using  the  surface  admittance  finite  element  approach. 

The  algebraic  companion  form  of  the  transmission  line  and  electromagnetic  model  is 
determined  to  produce  a  terminal  interface  that  fits  the  network  solver.  This  may  be  adapted  to  fit  any 
circuit  simulator  and  may  prove  beneficial  in  incorporating  many  other  kinds  of  analysis  such  as  those 
found  in  microelectronic  packaging  applications. 
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0.  ABSTRACT 

The  residual  vector  R]  =  [Z]A]  -  B]  where  [Z]  is  a  coefficient  matrix,  A]  is  a  vector  of  unknowns 
and  B]  is  a  right-hand  side  vector,  is  often  used  as  a  measure  of  solution  error  when  solving 
linear  systems  of  the  kind  that  arise  in  computational  electromagnetics.  Residual  errors  are 
of  particular  interest  in  using  iterative  solutions  where  they  are  instrumental  in  determining 
the  next  trial  answer  in  a  sequence  of  iterates.  As  demonstrated  here,  when  a  matrix  is  ill- 
conditioned,  the  residual  may  imply  the  solution  is  more  accurate  solution  than  is  actually 
obtained. 


1.  MATRIX  CONDITION  NUMBER  AND  SOLUTION  ACCURACY 

In  previous  related  work  [Miller  (1995)]  a  study  was  described  that  investigated  the  behavior 
of  ill-conditioned  matrices  having  the  goal  of  numerically  characterizing  their  information 
content.  One  numerical  result  from  that  study  was  that  the  solution  accuracy  (SA)  is  related 
to  the  coefficient  accuracy  (CA)  and  condition  number  (CN),  all  expressed  in  digits,  approx¬ 
imately  as  SA  ^  CA  -  CN.  This  conclusion  was  based  on  using,  as  one  measure  of  SA,  a 
comparison  of  [Z][Y]  with  [I],  where  [Z]  is  a  matrix  under  study,  [Y]  is  its  computed  inverse  and 
[i]  is  the  identity  matrix. 

CNs  can  generally  be  expected  to  grow  with  increasing  matrix  size,  even  for  one  as  benign 
as  haying  all  coefficients  being  random  numbers.  For  some  matrices,  the  Hilbert  matrix  for 

example,  one  of  those  studied,  the  CN  can  grow  much  faster,  being  of  order  io1  5N,  for  a 
matrix  of  size  NxN.  A  large  matrix  CN  was  encountered  in  later  work  that  involved  model- 
based  parameter  estimation  (MBPE)  for  adaptive  sampling  and  estimation  of  a  transfer  fun¬ 
ction  [Miller  (1996)]  using  rational  functions  as  fitting  models  (FM).  For  example,  when  us¬ 
ing  simple  LU  decomposition  to  solve  even  a  low-order  system,  say  one  having  fewer  than 
20  coefficients,  the  CN  might  exceed  106.  (Note  that  this  problem  can  be  circumvented  by 
using  a  more  robust  solution,  such  as  singular-value  decomposition,  but  that’s  also  left  for  a 
later  discussion.)  An  interesting  aspect  of  these  large  CNs  was  that  the  match  of  the  FM 
with  the  original  data  when  computed  using  coefficients  obtained  from  [Y]xB],  with  B]  the 
right-hand-side  vector,  could  be  much  less  accurate  than  when  using  coefficients  instead 
obtained  from  back  substitution. 
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2.  SOME  NUMERICAL  RESULTS 

A  typical  result  that  demonstrates  this  behavior  is  shown  in  Fig.  1.  The  specific  situation  il¬ 
lustrated  is  the  match  between  the  original  data  and  the  FM  (using  a  numerator  polynomial 
of  order  n  =  7  and  denominator  polynomial  order  of  d  =  6)  as  the  sample  spacing  is  varied. 

A  fit  of  10  digits  is  equivalent  to  a  residual  error  of  10‘10.  A  large  difference  can  be  seen 
between  the  fit  obtained  using  coefficients  from  an  inverse  operation  compared  with  using 
those  obtained  from  back  substitution.  Note  that  the  poles  in  the  spectrum  used  for  this  ex¬ 
periment  are  spaced  one  unit  apart.  The  improvement  in  the  inverse  result  as  the  data 
spacing  increases  towards  a  Nyquist-like  interval  of  0,5  occurs  because  the  CN  of  the  data 
matrix  decreases. 

Some  additional  computer  experiments  were  conducted  to  explore  this  behavior,  with  the 
results  of  one  shown  in  Fig.  2,  where  several  accuracy  (or,  conversely,  error  measures)  are 
shown  as  a  function  of  matrix  order  for  a  Hilbert  matrix.  The  quantities  plotted  in  Fig.  2  are: 


A]ex  -  A]bs  and  A]ex  -  A]jnv, 

[Z]Ajbs  -  B]  and  [Z]A]jnv  -  B], 

[Y] ex  -  [Y]comp. 

[Z] [Y]comp  -  [I] 

where  “bs”  and  “inv”  refer  to  a  solution  vector  obtained  using  back  substitution  or  inversion, 
and  “ex”  and  “comp”  refer  to  an  exact  analytical  or  computed  inverse  matrix,  respectively. 
Results  shown  were  developed  using  a  single  right-hand  side  having  all  unit  entries.  The 
various  accuracy  results  are  derived  by  computing  an  RMS  difference  between  their  re¬ 
spective  vectors  or  matrices. 

Although  a  different  problem  from  that  illustrated  in  Fig.  1,  the  residuals  are  qualitatively 
similar  in  exhibiting  a  back-substitution  accuracy  that  is  consistently  higher  than  that  from 
the  inverse  solution.  Interestingly,  of  the  six  results  displayed  all  are  in  substantial  agree¬ 
ment  except  for  the  back-substitution  residual.  At  about  N  =  18  and  beyond,  all  reach  a 
noise  floor.  For  the  former  five,  this  implies,  considering  a  compute  precision  of  24  is  being 
used,  a  CN  a  24,  which  is  not  inconsistent  with  io15x18  =  1027.  However,  the  noise  floor 
for  the  back-substitution  residual  remains  at  about  13  digits,  the  explanation  for  which  is  not 
obvious.  Perhaps  most  interesting  is  that  A]bs  and  A]jnv  exhibit  comparable  accuracies  in 
spite  of  the  great  differences  displayed  by  their  residuals.  In  other  words,  Ajbs  is  not  as  ac¬ 
curate  as  might  be  inferred  from  its  residual. 

As  an  explanation  for  the  declining  accuracy  exhibited  in  Fig.  2,  the  result  of  averaging 
some  of  the  more-often  used  CNs  is  plotted  in  Fig.  3,  where  it  can  be  seen  that  a  ceiling  of 
about  the  compute  precision  is  reached.  A  different  way  to  look  at  the  CN  is  to  plot  the  sin¬ 
gular-value  spectrum  of  a  matrix  as  is  done  for  the  Hilbert  matrix  in  Fig.  4.  In  this  case,  N  = 
30  and  compute  precisions  ranging  from  8,  16,  24,  32  and  40  are  used.  In  each  case,  the 
dynamic  range  of  the  spectrum  approximately  equals  the  compute  precision,  and  provides 
another  measure  of  the  condition  number. 
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3.  CONCLUDING  COMMENTS 

To  return  to  the  original  problem  that  motivated  this  discussion,  it’s  not  clear  why  there  is 
such  a  difference  between  the  different  residuals  shown  in  Figs.  1  and  2.  My  particular  rea¬ 
son  for  examining  these  results  is  the  implication  they  may  have  when  using  residuals  in 
determining  the  convergence  of  an  iterative  solution.  It  seems  reasonable,  if  the  residual 
error  is  smaller  than  the  actual  error  in  an  iterated  sequence  of  solution  estimates,  to  con¬ 
clude  that  relying  on  the  residuals  as  an  indicator  of  solution  accuracy  could  be  misleading. 
Of  course,  it  must  be  noted  that  the  difference  between  the  two  error  measures  appears  to 
be  dependent  on  the  CN  of  the  matrix  being  solved.  This  could  be  one  more  reason  why, 
as  problems  are  being  modeled  using  more  and  more  unknowns,  the  potential  related  in¬ 
crease  in  CN  needs  to  be  considered  in  developing  solution  strategies.  Also,  possibly  a  dif¬ 
ferent  measure  of  residual  error  would  circumvent  or  reduce  the  effect  discussed  here. 
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Figure  1.  Fit,  in  digits,  between  the  data  samples  used  for  computing  the  coefficients  of  a  rational-func¬ 
tion  model  and  the  model  results  as  a  function  of  normalized  sample  spacing,  with  computations  done  in 
24-digit  compute  precision.  Results  from  model  whose  coefficients  are  obtained  by  back  substitution  are 
shown  by  the  open  circles  and  those  solved  by  multiplying  the  right-hand-side  vector  using  an  inverse 
matrix  are  shown  by  the  solid  circles. 
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Figure  2.  Various  accuracy  measures  for  solution  of  an  NxN  Hilbert  matrix.  The  circles  show  results 
for  the  residuals  and  the  squares  display  the  solutions,  both  obtained  for  a  right-hand  side  vector  having 
all  unit  values  (in  both  cases  the  open  symbols  represent  inverse  results  and  sold  symbols  the  back-sub¬ 
stitution  results).  The  open  triangles  exhibit  the  result  of  comparing  the  computed  and  exact  inverse  ma¬ 
trices  while  the  solid  triangles  compare  the  product  of  the  original  and  inverse  matrices  with  the  identity 

matrix. 


Figure  3.  The  average  condition  number  of  a  Hilbert  matrix  exhibits  an  approximate  lO^-^N  behavior  as 
expected.  It  maximizes  at  a  value  of  about  the  compute  precision,  which  is  24  digits  for  this  computa¬ 
tion. 
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Figure  4.  The  normalized,  singular-value  spectrum  for  a  Hilbert  matrix  of  N  =  30  with  the  compute  pre¬ 
cision  a  parameter.  These  results  demonstrate  the  large  condition  number  of  a  Hilbert  matrix  while  also 
illustrating  effect  of  compute  precision  on  computations  for  such  a  problem. 
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Abstract.  A  new  benchmark  is  introduced  for  numerical  solutions  of  transient  or  wideband 
electromagnetic  scattering  problems,  and  is  applied  to  a  time-domain  integral  equation  (TDIE) 
solution  scheme.  The  benchmark  is  based  on  a  computation  of  the  direct  backscatter  of  a  spherical 
cap  reflector.  This  structure  possesses  a  surface,  an  edge  and  a  cavity,  and  fully  tests  the  ability 
of  the  TDIE  to  model  the  mechanisms  involved  in  scattering  from  an  arbitrary  metallic  obstacle. 
A  semi-analytic  solution  is  obtained  for  the  direct  backscatter  of  a  spherical  cap  reflector  using 
an  approach  based  on  a  method  of  regularisation.  A  comparison  is  made  of  the  numerical  TDIE 
solutions  with  those  accurately  obtained  by  the  semi-analytic  regularisation  method;  the  TDIE 
approach  is  shown  to  model  accurately  all  of  the  scattering  mechanisms  observed. 

1.  Introduction 

One  of  the  most  fundamental  problems  in  computational  electromagnetics  is  that  of  transient  or 
wideband  scattering  by  an  arbitrary  metallic  obstacle.  A  time-domain  integral  equation  (TDIE) 
approach  to  this  problem  has  many  significant  advantages.  Arbitrary  scatterers  are  easily  mod¬ 
elled,  since  a  triangular  mesh  may  be  used  to  discretise  the  obstacle  surface.  The  Sommerfeld 
radiation  condition  is  automatically  enforced  avoiding  the  need  for  artificially  imposed  boundary 
conditions,  such  as  are  required  with  finite  difference  schemes.  The  calculation  is  performed  on 
a  two-dimensional  grid  of  surface  quantities  rather  than  a  three-dimensional  grid  of  fields;  this 
allows  relatively  large  obstacles  to  be  modelled.  Wideband  scattering  results  can  be  obtained 
in  the  frequency  domain  without  the  need  for  repeated  matrix  inversion,  which  arises  with  a 
frequency-domain  integral  equation  approach. 

Another  significant  advantage  of  the  TDIE  approach  is  that  it  provides  direct  physical  insight 
into  the  mechanisms  of  transient  scattering.  Different  scattering  mechanisms,  such  as  specular 
and  creeping  wave  responses,  are  separated  in  time  and  can  be  distinguished  from  each  other, 
allowing  their  relative  importance  to  be  determined.  Furthermore,  since  the  TDIE  approach  is 
formulated  in  terms  of  the  surface  current  and  charge  densities  on  the  scatterer,  physical  insight 
can  be  gained  into  the  response  of  the  obstacle  itself. 

There  is  widespread  interest  in  establishing  the  validity  and  accuracy  of  general  purpose  numerical 
electromagnetics  codes.  Canonical  diffraction  problems,  such  as  described  in  [1],  provide  a  source 
of  accurately  computable  analytical  solutions  which  therefore  can  be  viewed  as  benchmarks  for 
comparison.  Unfortunately,  the  class  of  analytically  soluble  scattering  problems  is  small;  it  is 
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difficult  to  find  suitable  problems  to  test  numerical  predictions  of  scattering  from  objects  even  of 
modest  complexity,  comprising  say,  edges,  cavities  (apertures),  and  dielectric  or  metal  inclusions. 

In  this  paper  we  discuss  the  canonical  problem  of  transient  scattering  from  a  spherical  cap  reflector. 
This  obstacle  is  of  particular  interest  since  it  comprises  a  surface,  an  edge  and  a  cavity,  and  thus 
tests  the  ability  of  any  general  purpose  (time  domain)  numerical  code  to  model  all  of  the  significant 
scattering  mechanisms  which  might  be  encountered  in  an  arbitrary  metallic  obstacle.  The  other 
advantage  of  the  spherical  cap  reflector  is  that  a  semi-analytical  solution  of  guaranteed  accuracy 
may  be  obtained  via  a  frequency-domain  method  of  regularisation.  The  accuracy  of  this  solution 
can  rigorously  estimated  in  terms  of  a  truncation  number  N,  and  can  be  made  arbitrarily  accurate 
as  N  oo,  thus  fulfilling  the  requirements  of  a  benchmark. 

In  the  next  section  of  this  paper  we  describe  the  scattering  problem  and  how  the  semi-analytic 
solution  is  obtained  via  the  method  of  regularisation  (MoR).  Section  3  describes  the  TDIE  ap¬ 
proach  to  the  same  problem,  and  the  results  obtained  are  then  compared  with  the  results  of  the 
MoR  approach  in  section  4. 


2.  The  method  of  regularisation 


Our  benchmark  scattering  problem  is  depicted  in  Figure  1  and  may  be  described  as  follows.  A 
transient  or  wideband  electric  field  is  incident  upon  the  concave  surface  of  a  spherical  cap  reflector, 
along  its  axis  of  rotational  symmetry.  The  direct  backscattered  electric  field  is  determined,  from 
which  the  wideband  radar  cross  section  of  the  spherical  cap  reflector  may  be  obtained.  It  is  this 
wideband  spectrum  which  will  form  the  basis  of  our  comparisons. 
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Figure  1:  The  dielectric  sphere  with  spherical  cap  reflector. 

In  order  to  obtain  a  semi-analytic  solution  we  formulate  a  similar,  but  more  general,  frequency- 
domain  problem:  determining  the  direct  backscatter  of  a  homogeneous  dielectric  sphere  with 
spherical  cap  reflector.  The  sphere  will  be  supposed  to  be  of  radius  a,  have  dielectric  constant  €i, 
and  be  capped  by  a  perfectly  conducting  (metallic)  layer,  as  depicted  in  Figure  1.  The  composite 
structure  is  assumed  to  be  surrounded  by  a  dielectric  of  permittivity  eo,  and  is  illuminated  by 
a  time-harmonic  incident  field  E '*  of  wavenumber  k$.  Then,  if  the  direct  backscatter  of  the  lens 
reflector  Es  can  be  determined,  its  radar  cross  section  (as  a  function  of  k0a)  may  be  obtained. 
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Let  us  briefly  describe  the  features  of  the  method  providing  our  benchmark  solution.  It  is  discussed 
more  fully  in  [2]  to  which  the  reader  is  referred  for  further  details;  this  reference  considers  the 
more  general  structure  of  a  multi-layer  dielectric  sphere  with  a  metallic  cap,  of  which  the  structure 
depicted  in  Figure  1  is  a  particular  case. 

In  order  to  calculate  the  time-harmonic,  backscattered  far-field  Es  of  the  lens  reflector  the  ap¬ 
proach  of  [2]  uses  Debye  potentials  to  describe  the  electric  and  magnetic  fields.  Then,  due  to  the 
symmetry  of  the  problem,  the  scattered  fields  may  be  represented  by  series  expansions  of  spherical 
wave  functions,  with  electric  and  magnetic  Fourier  coefficients  to  be  determined. 

The  key  to  the  solution  method  relies  upon  ideas  contained  in  [3,  4,  5].  This  requires  a  transfor¬ 
mation  of  (i)  the  boundary  conditions  for  the  tangential  electric  field  components  on  the  metallic 
cap,  and  of  (ii)  the  continuity  conditions  for  the  tangential  magnetic  field  components  on  the 
non-metallic  spherical  surface,  to  the  corresponding  boundary  conditions  for  the  Debye  potentials 
and  their  derivatives.  The  boundary  conditions  for  the  Debye  potentials  are  not  separated  in  the 
case  of  an  unclosed  (open)  spherical  shell,  and  this  leads  to  coupling  of  the  scattered  electric  and 
magnetic  waves.  This  is  in  contradistinction  to  the  decoupled  system  that  arises  in  the  analysis  of 
the  perfectly  conducting  closed  sphere,  leading  to  the  well  known  Mie  series  solution. 

We  thus  obtain  two  pairs  of  coupled  dual  series  equations  for  the  Fourier  coefficients  of  the 
unknown  electric  and  magnetic  fields.  The  dual  series  kernels  are  given  by  the  associated  Legendre 
functions  P„(cos  6).  The  four  coupled  systems  of  equations  depend  upon  parameters  determined 
by  the  boundary  conditions  on  the  surface  of  the  lens.. 

However,  the  resultant  system  of  equations  is  Fredholm  type  of  the  first  kind.  In  order  to  provide 
a  reliable  benchmark,  a  transformation  of  this  system  is  needed  to  obtain  a  set  of  equations  soluble 
by  a  numerically  stable  and  rapidly  converging  algorithm.  The  regularisation  method  described 
in  [3,  4,  5]  transforms  this  coupled  system  of  first  kind  equations  into  four  coupled  systems  of 
equations,  which  are  Fredholm  type  of  the  second  kind.  These  possess  significant  advantages  for 
numerical  stability  and  convergence  over  their  counterparts  of  the  first  kind. 

Whilst  the  resultant  system  has  infinitely  many  linear  equations,  estimates  of  the  rate  of  conver¬ 
gence  permit  the  use  of  truncation  to  a  finite  dimensional  system  from  which  numerical  results 
with  guaranteed  accuracy  may  be  obtained.  The  algorithm  thus  employed  is  free  from  any  limita¬ 
tions  on  parameters  such  as  the  permittivities  eoAi  of  the  dielectric  regions,  the  angle  of  opening 
Oq  of  the  spherical  cap,  or  the  relative  wavenumber  k^a.  In  particular,  eo  and  ei  may  be  assumed 
equal  to  the  permittivity  of  free  space,  yielding  the  radar  cross  section  of  a  spherical  cap  reflector. 
Further,  by  varying  the  relative  wavenumber  k^a  of  the  incident  field  a  wideband  radar  cross 
section  may  be  determined  for  the  spherical  cap  reflector.  It  is  this  spectrum  which  we  wish  to 
compare  with  the  results  of  the  TDIE  approach  described  in  the  next  section. 


3.  Numerical  solution  of  the  scattering  problem 

The  TDIE  approach  which  we  follow  is  based  on  a  time-domain  solution  of  the  electric  field 
integral  equation  (EFIE).  Denote  by  S  the  surface  of  the  spherical  cap  reflector  which  is  assumed 
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to  be  surrounded  by  free  space  with  permittivity  e  and  permeability  denote  by  c  the  speed 
of  light.  The  cap  is  illuminated  by  a  transient  electric  field  E!(r,t)  which  is  incident  upon  the 
spherical  cap  along  its  axis  of  rotational  symmetry.  This  field  induces  a  surface  current  density 
J(r,  t)  and  surface  charge  density  p(r,  t)  on  S,  which  in  turn  give  rise  to  a  scattered  electric  field 
E^(r,t).  If  the  transient  backscattered  electric  field  can  be  determined  then  a  wideband  radar 
cross  section  can  be  obtained  for  the  spherical  cap  reflector. 


Define  a  vector  potential  in  the  usual  manner 

A(r,t)  =  f  / 

— v  '  Air  Js  Ir-n'l 


(1) 


and  a  scalar  potential  <f>(r,t)  similarly  [6].  The  field  radiated  by  the  scatterer,  Es(r,t),  can  be 
represented  in  terms  of  these  potentials.  Further,  since  the  total  electric  field  tangential  to  a 
perfectly  conducting  surface  is  zero,  ( E 7*  +  Es)tan  =  0,  we  may  derive  the  EFIE 


E(r,t)tan  =  W(r,i)  +  -=(r,t) 


(2) 


In  addition,  we  may  deduce  a  continuity  equation  which  will  hold  on  the  scatterer  surface 

^fct)  +  VsJfci)  =  0,  (3) 

where  V5  is  the  surface  divergence  on  S.  Equations  (2)  and  (3)  are  sufficient  to  determine  a 
solution  for  J  and  p,  given  suitable  initial  and  boundary  conditions. 


In  order  to  solve  (2)  and  (3)  numerically,  using  the  time-domain  method  of  moments  we  first 
discretise  our  problem  using  discrete  time  steps  kAt,  (k  =  0, 1, . . .)  and  setting  p(r,  kAt)  =  pk(r)> 
and  similarly  for  other  functions.  Time  derivatives  are  approximated  by  centred  differences. 


We  next  select  spatial  basis  functions  for  J  and  p,  together  with  a  suitable  testing  procedure. 
The  basis  functions  which  we  choose  are  those  commonly  exploited  for  numerical  solutions  of 
the  EFIE,  in  both  the  time  and  frequency  domains.  They  are  described  in  more  detail  in  [7,  8]. 
Define  a  grid  on  S  with  Nc  triangular  patches  arranged  such  that  the  grid  has  Ne  internal  edges 
separating  adjacent  triangles.  Using  this  grid  we  may  define  a  set  of  spatial  basis  functions  for  the 
surface  current,  /  (r),  1  <  n  <  Ne,  which  are  piecewise  linear  over  the  internal  edges  of  the  grid. 
We  may  also  define  a  set  of  Nc  spatial  basis  functions  for  the  surface  charge,  which  are  piecewise 
constant  on  the  triangles  of  the  grid.  See  [8]  for  further  details.  The  surface  current  may  thus  be 
approximated  by 

lk(r)  »  /fe)  =  x;££,(d.  w 

n=l 

where  I*  are  a  set  of  unknown  expansion  coefficients.  The  surface  charge  may  be  similarly  ap¬ 
proximated  in  terms  of  the  Nc  scalar  basis  functions  defined  on  the  triangular  grid  elements. 


To  discretise  the  EFIE  we  apply  a  form  of  Galerkin’s  method.  Define 


tan 


(5) 
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where  A  may  be  defined  in  terms  of  the  approximate  surface  current  J  via  equation  (1)  and 
where  may  be  defined  in  terms  of  the  approximate  surface  charge  density  in  an  analogous 
manner.  In  order  to  ensure  that  the  residual  error  in  our  approximation  of  the  EFIE  is  orthogonal 
to  our  chosen  basis  functions  we  apply  the  (Galerkin)  testing  procedure 

JsRE  ■  £n{r)  dS  =  0,  1  <  n  <  Ne.  (6) 

A  similar  approach  may  be  used  to  test  the  continuity  equation  with  respect  to  the  Nc  scalar 
basis  functions.  As  a  result  a  total  of  Ne  +  Nc  equations  may  be  obtained  in  terms  of  the  Ne 
unknown  expansion  coefficients  for  the  surface  current  density,  I*,  and  the  Nc  unknown  expansion 
coefficients  for  the  surface  charge  density.  If  all  of  these  expansion  coefficients  are  known  for  the 
first  k  time  steps  then  they  may  be  calculated  for  time  step  k+l  using  a  time-marching  algorithm, 
as  described  in  [8].  Note,  for  example,  that  the  unknown  surface  current  coefficients,  J*+1,  may 
be  determined  directly  from  equation  (5).  In  this  manner,  a  solution  may  be  obtained  for  any 
given  time  interval. 

Since  the  resulting  equations  are  of  the  first  kind  they  are  prone  to  numerical  instability  under 
certain  conditions.  The  stability  of  such  first  kind  numerical  schemes  can  be  greatly  improved, 
however,  by  the  use  of  averaging  techniques  [9].  The  method  employed  in  our  solution  scheme  is 
a  three-step  averaging  scheme  of  the  following  form.  Suppose  that  a  solution  has  been  obtained 
for  the  current  density  expansion  coefficients  at  time  step  k ;  denote  these  coefficients  by  /*. 
Using  a  simple  time-marching  algorithm  calculate  the  coefficients  for  the  next  three  time  steps; 
denote  these  coefficients  by  /*+1,  /*+2  and  /*+3.  The  coefficients  for  time  step  k+l  may  now  be 
determined  by  the  averaging  formula 

j„i+,=(2/‘+/j+i+/;+3)/4- 


4.  Results 

Calculations  were  performed  using  the  TDIE  approach  described  in  the  last  section  for  the  problem 
of  transient  electromagnetic  scattering  of  a  Gaussian  waveform  by  the  spherical  cap  reflector.  The 
choice  of  time  step  and  triangular  grid  imposed  on  the  cap  followed  the  standard  guidelines  of 
[7,  8].  The  incident  plane  wave  pulse  (of  peak  amplitude  1  V/m)  propagated  along  the  axis  of 
rotational  symmetry  of  the  reflector,  as  shown  in  Figure  1,  with  a  polarisation  perpendicular  to 
this  axis.  A  half-angle  of  90  —  30°  was  assumed.  Denote  by  A  the  ratio  of  the  Gaussian  pulsewidth 
(full  width  half  height)  to  the  radius  of  curvature  of  the  cap,  a. 

Four  simulations  were  performed  with  A  =  1, 1/2, 1/3  and  1/4.  In  each  case  the  direct  backscat- 
tered  far-field  was  computed  and  a  wideband  radar  cross  section  determined  via  discrete  Fourier 
transformation  of  the  resulting  time  series. 

The  backscattered  far-fields  of  the  spherical  cap  reflector  subject  to  incident  fields  with  A  =  1  and 
A  =  1/2  are  depicted  in  Figure  2,  with  a  dash  curve  and  a  solid  curve  respectively.  The  time  scale 
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is  in  units  of  light-metres.  The  late-time  response  of  the  far-field,  i.e.  the  response  for  t  >  15,  has 
been  magnified  by  a  factor  of  106  for  clarity.  The  first  positive  peak  in  the  backscattered  far-field 
is  that  due  to  the  specular  response;  however,  as  expected  for  electrically  small  scatterers,  the 
early  time  response  waveform  is  approximately  the  second  derivative  of  the  incident  waveform. 
Thereafter  late-time  ringing,  of  smaller  magnitude,  can  be  observed  due  to  the  flow  of  charges 
back  and  forth  on  the  obstacle  as  a  result  of  the  excitation  caused  by  the  incident  field.  When 
A  =  1/2  the  pulsewidth  is  half  that  of  the  pulse  when  A  =  1.  This  naturally  increases  the  high- 
frequency  content  of  the  incident  field  and  leads  to  an  increase  in  the  specular  response.  Note 
also  that  the  late-time  response  is  periodic  but  exponentially  decaying. 


Figure  2:  Backscattered  far-field  for  A  =  1  Figure  3:  Backscattered  far-field  for  A  1/3 

(dash  curve)  and  A  =  1/2  (solid  curve).  (dash  curve)  and  A  =  1/4  (solid  curve). 


The  backscattered  far-fields  of  the  spherical  cap  reflector  subject  to  incident  field  with  A  -  1/3 
and  A  =  1/4  are  depicted  in  Figure  3,  with  a  dash  curve  and  a  solid  curve  respectively.  The  late¬ 
time  response  of  the  far-field,  i.e.  the  response  for  t  >  15,  has  been  magnified  by  a  factor  of  10 
for  the  case  A  =  1/3  and  by  103  for  the  case  A  =  1/4.  It  is  evident  again  that  as  the  pulsewidth 
shortens,  and  the  high-frequency  content  of  the  pulse  increases,  the  specular  response  increases. 
Moreover,  as  expected  for  electrically  large  apertures,  the  early  time  response  is  now  approximately 
proportional  to  the  first  derivative  of  the  incident  waveform;  also  the  late-time  ringing  of  the 
cap  decreases  in  magnitude.  It  may  also  be  seen  that  in  the  case  A  =  1/4  the  creeping  w&ve 
contribution  has  separated  itself  from  the  late-time  ringing  and  is  quite  distinctly  visible.  The 
other  effect  which  is  evident  in  the  late-time  behaviour  of  the  far-field  when  A  -  1/4  is  the 
appearance  of  an  exponentially-growing  oscillatory-type  instability,  as  described  in  [9].  In  order 
to  suppress  this  kind  of  instability,  recourse  must  be  made  either  to  a  high  order  averaging  scheme, 
or  to  a  second  kind  numerical  scheme  involving  some  regularisation  of  the  TDIE  formulation. 

In  order  to  obtain  wideband  radar  cross  sections  for  the  spherical  cap  reflector  from  these  time- 
series,  a  discrete  Fourier  transform  was  applied  to  both  the  incident  Gaussian  pulse  and  the 
computed  far-field  response,  from  which  the  ratio  of  the  resulting  spectra  was  obtained.  The 
radar  cross  sections  are  depicted  in  Figures  4  and  5  together  with  the  results  of  the  semi-analytic 
MoR  approach,  which  is  depicted  by  the  dot-dash  curve. 
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With  a  relatively  wide  pulse,  A  =  1,  the  radar  cross  section  agrees  with  the  benchmark  solution 
only  for  small  relative  wavenumbers,  k0a  <  10.  By  halving  this  pulsewidth,  A  =  1/2,  or  alter¬ 
natively  by  doubling  the  bandwidth  of  the  pulse,  agreement  can  be  extended  up  to  k0a  &  20. 
Similarly  with  A  =  1/3,  reasonable  agreement  can  be  achieved  for  k0a  <  30.  In  the  case  when 
A  =  1/4,  however,  the  radar  cross  section  contains  excessive  high  frequency  noise,  due  to  the  late¬ 
time  instability.  In  order  to  partially  suppress  this  defect  the  far-field  time-series  was  truncated. 
This  eliminated  much  of  the  spurious  high-frequency  energy  in  the  spectrum.  The  resulting  radar 
cross  section  is  depicted  in  Figure  5.  Agreement  can  be  seen  for  the  expected  range  of  relative 
wavenumbers,  k0a  <  40  but  excessive  high-frequency  energy  is  still  apparent  in  the  spectrum.  It 
is  also  evident,  from  Figures  4  and  5,  that  the  accuracy  of  the  numerically  computed  radar  cross 
section  increases  as  the  pulsewidth  of  the  incident  field  is  decreased,  at  any  particular  relative 
wavenumber  koa. 


Figure  4:  Radar  cross  section  for  A  =  1 
(dash  curve),  A  =  1/2  (solid  curve)  and 
method  of  regularisation  (dot-dash  curve). 


Figure  5:  Radar  cross  section  for  A  =  1/3 
(dash  curve),  A  =  1/4  (solid  curve)  and 
method  of  regularisation  (dot-dash  curve). 


It  is  clear  from  these  results  that  the  TDIE  scheme  is  modelling  accurately  all  of  the  various 
scattering  mechanisms  which  contribute  to  the  response  of  the  spherical  cap  reflector.  This  is 
evident  from  the  convergence  of  the  numerically  computed  radar  cross  section  to  that  which  has 
been  calculated  using  the  semi-analytic  MoR  approach. 

5.  Conclusions 


The  spherical  cap  reflector  is  an  interesting  canonical  scatterer  to  study  since  it  comprises  a 
surface,  an  edge  and  a  cavity.  Such  a  structure  tests  the  ability  of  general  purpose  numerical 
codes,  including  those  based  on  the  TDIE,  to  model  all  of  the  significant  scattering  mechanisms 
which  might  be  encountered  in  an  arbitrary  metallic  obstacle.  This  scattering  problem  is  also 
particularly  suitable  as  a  benchmark,  since  a  semi-analytical  solution  may  be  obtained  via  a 
frequency-domain  method  of  regularisation.  This  method  yields  a  second  kind  system  of  equations 
which  can  be  rapidly  solved  to  any  desired  degree  of  accuracy. 
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In  this  paper  we  have  shown  that  TDIE  solutions  to  transient  scattering  problems,  obtained 
following  standard  discretisation  guidelines,  provide  accurate  estimates  for  the  radar  cross  section 
resulting  from  the  principal  scattering  mechanisms  encountered  with  a  spherical  cap  reflector. 
This  suggests  that  the  TDIE  is  well  suited  to  the  task  of  determining  the  transient  or  wideband 
scattering  of  an  arbitrary  metallic  obstacle. 

Of  course,  it  is  desirable  to  validate  general  purpose  numerical  codes  against  a  number  of  bench¬ 
marks,  each  relating  to  particular  physical  mechanisms  or  the  calculation  of  observable  physical 
quantities.  Although  this  paper  focussed  on  backscatter  calculations,  the  spherical  cap  reflector 
also  provides  benchmark  solutions  for  near-field  as  well  as  far-field  quantities,  under  any  angle  of 
illumination.  In  another  direction,  we  draw  attention  to  the  validation  described  in  [10]  of  wide¬ 
band  antenna  impedance  calculated  by  the  TDIE  code  against  benchmarks  provided  by  known 
impedances  of  wideband  (TEM)  horns. 
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1.  Summary 

We  utilize  the  fuzzy  logic  concepts  of  interval  of  confidence  at  a  given  level  of  perception  or  fi  — 
confidence  level  to  study  the  propagation  of  intervals  of  confidence,  regarded  as  specified  error  intervals, 
for  the  input  data  of  a  computer  code  to  the  error  intervals  of  the  output  parameters. 

We  contrast  the  fi  —  confidence  level  with  the  conventional  probability-confidence  in  an  error  interval 
for  a  variable.  That  probability-confidence,  or  p-confidence,  is  derived  from  the  probability  distribution  for 
that  variable.  We  show  that  a  fuzzy-logic  pt  —  confidence  curve  for  a  variable,  when  interpreted  as  a  p- 
confidence  curve,  yields  no  information  about  the  probability  curve  for  that  variable.  Therefore  the  ordinary 
operations  of  addition,  subtraction  and  multiplication  of  two  fuzzy  variables  at  a  given  level  of  presumption 
-interpreted  at  a  p-confidence  level-cannot  yield  the  probability  distribution  for  the  resultant  variable  and 
hence  its  p-confidence  at  the  same  level  of  presumption. 

We  illustrate  the  propagation  of  error  intervals  with  reference  to  the  critical  subroutines  FACTR  and 
SOLVE  in  the  Numerical  Electromagnetics  NEC2  computer  code.  These  solve  the  matrix  equation  AX  =  B 
for  X.  The  fuzzy  versions,  FACTRFUZ  and  SOLVEFUZ  track  error  intervals  for  each  parameter  calculated, 
based  on  the  fuzzy  arithmetic  rules  for  sum,  difference,  and  product  of  two  parameters  and  inversion  of  a 
single  parameter.  The  error  intervals  for  the  Ay  and  Bj  must  be  specified,  and  the  fuzzy  codes  compute  the 
error  intervals  for  the  Xj . 

Insertion  of  coding  for  error  intervals  throughout  a  large  code  is  impractical,  so  we  show  how  we  might 
propagate  small  error  intervals  at  the  input  directly  through  an  error-insensitive  portion  of  the  code  to  the 
error-sensitive  subroutine(s),  then  propagate  the  intervals  through  these  subroutines  line  by  line,  and  finally 
propagate  the  output  error  intervals  directly  through  the  remaining  error-insensitive  portion  of  the  code  to 
the  code  output. 


2.  Fuzzy  Logic  Concepts 

To  discuss  the  concepts  of  interval  of  confidence,  which  we  regard  as  error  interval,  and  level  of  perception 
or  ft  —  confidence  ( not  a  conventional  confidence )  level  consider  Fig.  1,  reproduced  from  Fig.  1.8  of  [1] 
with  some  horizontal  and  vertical  lines  added.  Parameter  p,0  <  fi  <  1,  on  the  vertical  axis  measures  a  level 
of  perception  or  p  —  confidence  level  for  the  fuzzy  intervals  for  x,  y,  and  z  =  x  +  y  defined  by  the  curves  A, 
B,  and  C,  respectively.  At  the  /i-level  of  perception  a  the  dashed  horizontal  line  defines  the  fuzzy  interval 
a2^]  for  x  and  corresponding  intervals  for  y  defined  by  curve  B  and  z  defined  by  curve  C. 

These  intervals  are  interpreted  as  follows.  At  the  a-level  of  presumption  we  presume  that  all  x-values 
lie  within  [a^®),  a2(“)j,  and  all y-values  lie  in  the  range  [6^“),  ftgC®)].  Therefore,  all  z  —  x  +  y  values  lie  in 
the  range  [ci^,  c2^“^  =  [ai(o)+  b^a\  a2(*)+  62(or)]. 

The  only  fuzzy-logic  concepts  we  require  for  our  error  analysis  are  the  rules  for  adding,  subtracting, 
and  multiplying  these  error  intervals,  and  determining  the  error  interval  for  an  inverted  parameter  from  the 
interval  for  the  parameter  itself.  We  require  no  other  information  from  the  parameter  curves  such  as  A,  B, 
and  C  of  Fig.  1. 

A  parameter  curve  such  as  A  tells  us  nothing  about  the  probability  distribution  for  its  parameter  x  and 
nothing  about  the  p-confidence  that  x  will  lie  within  an  interval  such  as  a2(®)].  In  order  to  gain  a 

perspective  about  fuzzy  logic  in  relation  to  conventional  probability-confidence  analysis  we  explain  this  in 
the  next  section. 
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3.  Relationships  Between  Fussy  Logic  Parameters  and  Probability  and  Probability-Confidence 
Distributions 

Curves  A  B  and  C  in  Fig.  1  all  have  the  property  that  the  fuzzy  interval  at  the  fi  -  confidence  level 
decreases  as  p’ increases.  This  suggests  interpreting  that  fuzzy  interval  at  a  given  value  n  -  a  as  a  conven- 
tnalTccnfidence  interval,  mearing  the  area  under  the  probability  curve  for  vanabfe  M  or  z  along  xts 
fuzzy  interval  is  that  confidence.  If  we  do  this  we  relate  the  p-confidence  Cp  tofia s  C  -  1  whenp 
is  zero  Cv  is  1  and  the  variable  has  its  maximum  range  of  ambiguity.  When  \i  -  1,  Cp 
confidence  that  the  variable  will  have  exactly  its  value  at  the  pea k  of  its  curve. 

The  mathematical  statement  of  this  is,  for  curve  A  in  Fig.  I, 
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Howemr,  wc  cannot  deduce  P(x)  from  this  relation!  The  reaaon  is  as  foUows:  if  w.  consider  a  smal  mcrease 
iC  downward  on  curve  A  at  level «,  say,  and  the  corresponding  changes  dr,  (negative)  at  a,  and  dr,  >  0 

at  „<•>  we  only  know  that  dCr  =  “ables  “  t0  S'”"*1'  umq“'ly  n'“h'r 

P(x)  on  the  positive-slope  side  of  the  curve  nor  P{x)  on  the  negative-slope  side. 

Without  these  probability  curves  for  x  and  y  we  could  not  generate  the  p-confidence  over  the  associated 
interval  [d<“>,  c2(“>]  =  [«/“>+  a2<“)+  h(o)]  for  z  which  only  coincidentally  would  equal  1  -  a.  The 

conventional  relation  for  P(z)  is 

P(z)  =•  J  P(x)P(y  =  z-  x)dx  for  all  a<*>  <x<  a\ <">  and  <  y  =  *  -  x  <  ,  (2) 

and  without  P(z)  we  could  not  compute  the  p-confidence  Cp  for  z  at  this  a  level  for  x  and  y  as 
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Here  the  limits  on  z  are  determined  from  (2). 

Fuzzy  logic  does  not  involve  p-confidence  levels  defined  this  way.  It  only  says  that,  at  a  level  of  presump¬ 
tion  a,  x  is  ambiguous  only  over  the  inverval  [a[a),a\°>]  and  y  is  ambiguous  only  over  the  interval^  ,  62  j, 
so  an  operation  A(0)B,  where  O  represents  addition,  subtraction,  multiplication,  or  inversion,  to  produce 
C  results  in  a  fuzzy  interval  [c<o),<4a))  determined  only  by  the  extremes  of  the  intervals  for  x  and  y 

These  are  the  only  properties  of  fuzzy  logic  we  employ  for  the  analysis  of  propagation  of  errors  through 
computer  codes. 


4.  Propagation  of  Errors  Through  Subroutines  FACTR  and  SOLVE,  Which  Solve  the  Matrix 
Equation  AX  =  Y  for  X  in  the  Numerical  Electromagnetics  Code  NEC2 

These  subroutines  are  critical  for  error  propagation  because  they  involve  division  by  a  parameter  which 

might  have  an  error  range  which  includes  zero.  _ ^  ’  . 

We  have  added  coding  for  the  appropriate  error  limits  to  FACTR  and  SOLVE  and  identified  the  revised 
subroutines  by  the  suffix  FUZ.  These  subroutines,  FACTRFUZ  and  SOLVEFUZ,  are  appended.  They  employ 
thes  fuzzy  addition  and  subtraction  rules  [1] 

A(+)B  =  [ai,  a2]  +  [&i,  &2]  =  [oi  +  <»i ,  a2  +  b2]  (4) 

A(-)B  =  [oi  -  b2,  a2  -  6i].  (5) 
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For  multiplication  the  subroutines  call  PRODFUZ,  also  in  Appendix  A,  which  computes  the  fuzzy  interval 
or  error  interval  for  A  (x)  B)  according  to  one  of  the  twelve  cases  summarized  in  Appendix  A  of  [lj. 

Error  intervals  for  the  inverse  l/j4[ai,a2]  =  [I/02, 1/a  1]  if  0  <  01  <  a2  or  ai  <  a2  <  0  are  adjusted  if 
one  of  the  bounds  equals  zero  or  the  interval  passes  through  zero.  In  the  former  case  the  bound  at  zero  is 
moved  toward  the  outer  bound  by  0.001  of  that  outerbound.  In  the  latter  case  the  error  interval  for  1/A  is 
set  to  [-1.E6.+1.E6]. 

Comments  are  inserted  throughout  FACTRFUZ,  SOLVEFUZ,  and  PRODFUZ  for  clarification. 

We  have  not  inserted  these  subroutines  into  NEC2  and  specified  error  intervals  in  problems  of  interest 
for  the  elements  of  A  (the  system  matrix)  and  computed  the  error  intervals  in  the  output  matrix  B  of 
SOLVEFUZ  (which  is  X  in  AX  =  B).  These  error  intervals  are  specified  in  the  matrices  AFUZ  and 
DFUZ(l),  DFUZ(2)  in  FACTRFUZ  and  AFUZ,  YFUZ  (1),  YFUZ(2),  BFUZ(l),  BFUZ(2)  in 
SOLVEFUZ.  However,  this  procedure  would  be  straightforward. 


5.  Propagation  of  Small  Error  Intervals  Through  a  Large  Computer  Code. 

It  might  be  impractical  to  insert  fuzzy  arithmetic  stateements  throughout  a  large  code  as  we  did  in 
FACTRFUZ  and  SOLVEFUZ.  But  if  we  could  localize  the  subroutines  critical  for  rerror  propagation  we 
might  proceed  in  an  approximate  way  as  follows. 

Let  the  matrix  Tj  transfer  the  input  column  matrix  W  of  data  to  the  column  matrix  X  which  is  input 
to  these  critical  subroutines.  Assume  these  subroutines  act  in  series  to  transfer  X  to  their  output  column 
matrix  Y.  Finally,  let  matrix  T2  transfer  Y  to  the  code  output  column  matrix  Z. 

These  matrix  operations  are  represented  with  their  row  x  column  dimensions  as 

(Ti)jtf*/v'WtfXl  =  XAfXi;  ApxAfXji/xi  =  Ypxi;  (T2)qx.pY.pxi  =  Zqxi  (6) 

We  assume  that  Tj  and  T2  remain  the  same  over  the  narrow  error  intervals  of  W  and  Y,  respectively. 

Now  consider  the  problem  of  evaluating  the  error  intervals  in  X  from  those  in  W  without  knowing  (or 
wanting  to  know)  the  analytic  form  of  the  (Ti)mns.  We  must  compute  the  error  interval  in  Xm  according 
to 


[X«  -  rm,Xm  +  xm]  =  (TjWfWr  -  wi ,  W,  +  tnj  +  (Ti)m2[W2  -  w2,W2  +  w2] 

N 

+  "b»>W*  +  4  for  l  <m<M  (7) 

n= 3 

We  have  specified  the  Xm,  Wi,  etc.  to  lie  at  the  centers  of  their  error  intervals. 

In  the  process  of  computing  Ti  W  =  X  for  all  the  midrange  values  of  the  Xm  we  must  know  all  values  of 
(Ti)m„Wn  for  each  mn  pair.  The  reason  is  that  we  must  know  what  the  magnitude  and  sign  of  each  (Ti)mn 
will  be.  A  term  in  (7)  For  example,  represented  by  t[ai,o2]  will  be  [iai,  ta2]  if  t  >  0  but  t[ai,a2]  =  [<a2,tai] 
if  t  <  0.  Therefore  the  signs  of  the  (Tj)mn  will  determine  how  the  successive  intervals  are  added  in  (7)  to 
determine  the  final  xm. 

In  this  way  we  evaluate  all  the  error  intervals  [Xm  —  xm,Xm  +  im]  for  1  <  m  <  M. 

We  next  perform  the  next  matrix  operation  in  (6)  to  propagate  the  error  intervals  in  X  to  those  in  Y, 
the  output  matrix  of  the  error-sensitive  subroutine(s) .  We  do  this  by  inserting  line-by-line  fuzzy  arithmetic 
coding  into  A  for  those  subroutine(s),  as  we  did  in  FACTRFUZ  and  SOLVEFUZ. 

Then  we  perform  the  last  matrix  operation  of  (6)  for  the  error  intervals  through  T2  in  the  same  manner 
that  we  propagated  the  error  intervals  in  W  to  those  in  X  in  (7). 

8.Reference 

[1]  A.  Kaufmann  and  M.  M.  Gupta,  Introduction  to  Fuzzy  Arithmetic,  Theory  and  AppKca- 
tions,  (paperback)  Van  Nostrand  Reinhold,  1991. 
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SUBROUTINE  FACTRFUZ  (N,  A,  AFUZ,  IP,NDIM) 

C A- MATRIX  NOT  TRANSPOSED-  Variables  are  Real,  not  Conplex 

CFCJZ  Changes  in  Subroutine  FACTR  of  NEC2  to  allow  for  error  intervals.  Interval 
CFtJZ  ends  are  specified  by  the  parameters  denoted  by  suffix  “FOZ11..  Intervals 
CFUZ  AFUZ (N,N,  1) ,  lower  end,  AFUZ(N,N,2),  upper  end,  must  be  specified  at  input. 
CFUZ  These  are  calculated  at  output  for  SOLVEFUZ,  which  also  operates  on  DFUZ  via 
CFUZ  COMMON/  SCRATM . 

SAVE  ,  , 

DIMENSION  A(51, 51)  , AFUZ  (51, 51, 2)  ,IP(5l)  ,ARJFUZ{2) , 

.PFUZ (2) , QFUZ (2) ,RFUZ(2) 

COMMON/SCRATM/  D(51)  , DFUZ  (51,2) 

INTEGER  R,RM1,RP1,PJ,PR 

IFLG=0 

DO  9  R=1,N 

DO  1  K=1,N 

D(K)=A(K,R) 

DFUZ(K,1)=AFUZ(K,R,1)  ;  DFUZ  (K,  2)=AFUZ  (K,R,  2) 

1  CONTINUE 
RM1=R-1 

IF  (RM1.LT.1)  GO  TO  4 
DO  3  J=1,RM1 
PJ=IP ( J) 

ARJ=D(PJ) 

ARJFUZ  (1)  =DFUZ  (PJ,  1)  ;  ARJFUZ  (2)=DFUZ  (PJ,  2) 

A(J,R)=ARJ 

AFUZ(J,R,1)=ARJFUZ(1)  ;  AFUZ  (J,R, 2)  =ARJFUZ  (2) 

D(PJ)=D(J) 

DFUZ  (PJ,  1)=DFUZ  (J,  1)  ;  DFUZ  (PJ,  2)  =DFDZ  (J,  2) 

JP1=J+1 

QFUZ  (1)  =ARJFUZ  (1)  ;  QFUZ  (2)  =ARJFUZ  (2) 

DO  2  I=JP1,N 
D(I)=D(I)-A(I,J)*ARJ 

PFUZ  (1)  =AFUZ  (I,  J,  1)  ;  PFUZ  (2)=AFUZ  (I,  J,  2) 

CFUZ  PRODFUZ  Corrputes  the  fuzzy  interval  for  PFUZ*QFUZ  =  RFUZ  according  to  the  table 
CFUZ  of  formulas  in  the  text. 

CALL  PRODFUZ  (PFUZ ,  QFUZ ,  RFUZ) 

CFUZ  DEFUZ  next  follows  the  subtraction  formula  for  fuzzy  intervals  according  to  the 
CFUZ  fonrtala  in  the  text 

DFUZ (I,1)=DFUZ (1,1) -RFUZ (2) 

DFUZ  (I,2)=DF0Z  (1, 2)  -RFUZ  (1) 

2  CONTINUE 

3  CONTINUE 

4  CONTINUE 

DMAX=D(R)  **2  IJust  an  indicator  of  pivot  magnitude 

IP  (R)  =R 

RP1=R+1 

IF  (RP1.GT.N)  GO  TO  6 
DO  5  I=RP1,N 

EIHAG=D(I) **2  !Just  an  indicator 

IF  (ELMAG.LT.DMAX)  GO  TO  5 
EMAX=ET*IAG 
IP  (R)  =1 

5  CONTINUE 

6  CONTINUE 

IF  (EMAX.LT.l.E-10)  IFDG=1  !  IFLAG 

PR=IP(R) 

A(R,R)—D(FR) 
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AFUZ(R,R,1)=DFUZ(PR,1)  ;  AFUZ  (R,R,  2)  =DFUZ  (PR,  2) 

D  (PR)  =D(R) 

DFUZ(PR,1)=DFUZ(R,1)  ;  DFUZ  (PR,  2)  =DFUZ  (R,  2) 

IF  (RP1.GT.N)  GO  TO  8 
ARJ=1./A(R,R) 

C . 

C. . .  Involved  coding  for  the  inverse,  to  change  the  error  interval  if  it  includes 
C...  zero. 

X1=AFUZ  (R,  R,  1)  ;  X2=AFUZ  (R,  R,  2 ) 

IF  (XI  .GT.  0.)  GO  TO  40  'Take  +inverse 
IF  (XI  -EQ.  0.  .AND.  X2  .GT.  0.)  GO  TO  42  Ilncrease  XI 

IF  (XI  .IT.  0.  .AND.  X2  .GT.  0.)  GO  TO  44  (Write  and  set  wide  bounds 

IF  (XI  .IT.  0.  -AND.  X2  .EQ.  0.)  GO  TO  46  (Decrease  X2 

IF  (X2  .LT.  0.)  GO  TO  40  !Take  -inverse 

42  WRITE  (6,102)  R,R,  AFUZ  (R,R,  1)  ,  AFUZ  (R,R,  2) 

102  FORMAT  (IX,  'A(  ’ ,  12,  1  ,  '  12,  *  j  ' ,  ’has  fuzz,  bounds  '  ,1PE12.3, 

.  1PE12.3,'--  increase  lower  one  to  0.001  of  upper  one') 

X1=0.001*X2;  GO  TO  40  (Take  +inverse 
44  WRITE  (6,104)  R,R,  AFUZ  (R,R,  1)  ,AFUZ  <R,R,  2) 

104  FORMAT  (IX, ’A( ' ,  12, ' , ' 12 , ' ) ' , 'has  fuzz,  bounds ',  1PE12 . 3 , 

1PE12.3,'”  SET  BOUNDS  -E6,+E6') 

ARJFUZ  (1)=-1.E6;  ARJFUZ (2) =+l.E6;  GO  TO  50 
46  WRITE  (6,106)  R,R, AFUZ  (R,R,  1)  ,AFUZ  (R,R,  2) 

106  FORMAT  (IX, 'A(  ’  ,12,  ' , '12,  •)  ' , 'has  fuzz,  bounds ',  1PE12 . 3 , 

1PE12 . 3 ,  *  -  -  decrease  T^Dper  one  to  0 . 001  of  lower  one ' ) 

X2=0.001*X1 

40  ARJFUZ  (1)=1./X2;  ARJFUZ  (2)  =1 . /XI 

50  CONTINUE 

C . 

DO  7  I— RP1 ,  N 
A(I,R)=D(I)*ARJ 

PFUZ (1)=DFUZ (1,1) ;  PFUZ (2)=DFUZ (I,  2) 

QFUZ  (1)=ARJFUZ (1) ;  QFUZ (2)=ARJFUZ (2) 

CALL  PRODFUZ (PFUZ ,  QFUZ , RFUZ) 

AFUZ  (I , R,  1 )  =RFUZ  (1 )  ,*  AFUZ  (I,R,2)=RFUZ  (2) 

7  CONTINUE 

8  CONTINUE 

IF  (IELG.EQ.0)  GO  TO  9 
WRITE  (6,10)  R,DMAX 
IFLG=0 

9  CONTINUE 
RETURN 

10  FORMAT  (1H  ,6HPIV0T(,I3,2H)=,E10.2) 

END 
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DO  3  1=1,  N 

PI=IP(I) 

y(X)=B(PI) 

YFOZ(I,1)=BFOZ(PI,1) ;  YFOZ(I,2)=BFOZ(PI,2) 

B  ( PI )  =B  ( I ) 

BFTJZ  (PI,  1)=BFUZ  (1, 1)  ;  BFOZ(PI,2)=BFOZ(I,2) 

IP1=I+1 

IF  (IP1.GT.N)  GO  TO  2 
DO  1  J=IP1,N 
B(J)=B(J)-A(J,I)*Y(I) 

PFOZ(l)=AFOZ(J,I,l)  ;  PFUZ(2)=AFOZ(J,I,2) 

QFUZ ( 1 ) =YFUZ (1,1)  ;  QFOZ(2)=YFOZ(I,2) 

CFOZ  PRGDFOZ  Computes  the  fuzzy  interval  for  PFOZ*QFOZ  =  RFOZ  according  to  the  table 

CFUZ  of  fornulas  in  the  text.  _ 

CALL  PRODFUZ  ( PFUZ ,  QFOZ ,  RFUZ)  . 

CFOZ  DEFOZ  next  follows  the  subtraction  formula  for  fuzzy  intervals  according  to  the 

CFOZ  forraila  in  the  test 

BF0Z(J,1)=BFUZ(J,1)  -RFOZ  (2) 

BF0Z(J,2)=BFUZ(J,2)  -RFOZ(l) 

1  CCNTINOE 

2  CONTINUE 

3  CCJTnKOE 
DO  6  K=1,N 
I=N-K+1 
SOM=0. 

SOMFOZ  { 1 )  =0 .  ;  SUMFUZ  (2)=0. 

IP1=I+1 

IF  (IP1.GT.N)  GO  TO  5 
DO  4  J=IP1,N 
S0M=SUM  +  A(I,J)*B(J) 

PF0Z(1)=AF0Z(I,J,1)  ;  PFOZ  (2)=AF0Z  (I,  J,2) 

QFOZ (1) =BFOZ (J, 1) ;  QFOZ (2)=BFOZ (J,2) 

CALL  PRODFOZ  (PFOZ, QFOZ, RFOZ)  . 

CFOZ  DEFOZ  next  follows  the  addition  formula  for  fuzzy  intervals  according  to  the 

CFOZ  formula  in  the  text  _  _ 

SOMFOZ  (1)  =SUMFUZ  (1)  +RPOZ  (1) ;  SUMFOZ  (2)=SOMFOZ  (2)  +RFOZ  (2) 

4  CONTINUE 

5  CONTINUE 

B(I)  =  (Y(I)  -SOM)/A(I,I) 

C. !  *  involved  "coding 'for  "the  "inverse,"  tochange  the  error  interval  if  it  includes 
C...  zero. 

Xl=AFOZ (1,1,1) ;  X2=AFOZ(I,I,2) 

IF  (Xl  .GT.  0.)  GO  TO  40  (Take  +inverse 

IF  (XI  .EQ.  0.  .AND.  X2  .GT.  0.)  GO  TO  42  (Increase  XI 
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IF  {XI  .LT.  0.  .AND.  X2  .GT.  0.)  GO  TO  44  !Write  and  set  wide  bounds 

IF  (XI  .LT.  0.  .AND.  X2  .EQ.  0.)  GO  TO  46  !Decrease  X2 

IF  (X2  .IIT.  0.)  GO  TO  40  ITake  -inverse 

42  WRITE  (6,102)  I,I,AFUZ(I,I,1)  ,AFUZ (1,1,2) 

102  FORMAT  (IX,  'A( '  ,12,  ’ ,  '12,  ' )  ' ,  ’has  fuzz,  bounds 1PE12 . 3 , 

1PE12.3,'--  increase  lower  one  to  0.001  of  upper  one') 

X1=0.001*X2;  GO  TO  40  ITake  + inverse 
44  WRITE  (6,104)  I, I,AFUZ  (1, 1, 1)  ,AFUZ  (I,  I,  2) 

104  FORMAT  (IX,  'A( ' ,  12, ' , '  12, ' } ' ,  'has  fuzz .  bounds ',  1PE12 . 3 , 

1PE12.3,'--  SET  BOUNDS  -E6,+E6') 

QFUZ  (1)=-1.E6;  QFUZ  (2)=+l  .E6;  GO  TO  50 
46  WRITE  (6,106)  I,  I, AFCJZ  (1, 1, 1)  , AFOZ  (1, 1, 2) 

106  FORMAT  (IX,  'A(  •  ,12, ' ,  *12, ' )  ' ,  'has  fuzz,  bounds ',  1PE12 . 3 , 

1PE12.3,  decrease  upper  one  to  0.001  of  lower  one') 

X2— 0 . 001*X1 

40  QFUZ(1)=1./X2;  QFUZ(2)=1./X1 

50  CONTINUE 

C. .  .  NOW  get  the  FUZfid.  bounds  on  (Y(I)  -SUM)  : 

PFUZ (1)=YFUZ (1,1)  -SUMFUZ  (2)  ;  PFUZ  (2)=YFUZ  (1,2)  -SUMFUZ (1) 

C. . .  Next,  the  final  bounds  on  B(I)  : 

CALL  PRODFUZ{ PFUZ, QFUZ, RFUZ) 

BFUZ(I,1)=RFDZ(1) ;  BFUZ (I, 2) =RFUZ (2) 

C . 

6  CONTINUE 
RETURN 
END 
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SUBROUTINE  PRODFUZ (PFUZ , QFUZ ,  RFUZ) 

******  ********* 

CFUZ  This  subroutine  computes  the  error  interval  for  PFUZ*QFTJZ  =  RFUZ, 

CFUZ  according  to  the  twelve  cases  for  the  product  in  Appendix  A  of  [1]  - 
CFUZ  Subscript  1  denotes  the  lower  fuzzy  limit;  subscript  2,  the  upper  one. 

SAVE 

DIMENSION  PFUZ  (2)  , QFUZ  (2)  , RFUZ  (2) 

A1=PFUZ (1)  ;  A2=PFUZ (2) ;  B1=QFUZ (1) ;  B2=QFUZ (2) 

IF  (0.  .LT.  Al)  GO  TO  2  leases  1,2,3 
IF  (A2  .LT.  0.)  GO  TO  4  leases  7,8,9 
C _  cases  4,5,6: 

IF  (0.  .LT.  Bl)  GO  TO  10  lease  4 

IF  (B2  .LT.  0.)  GO  TO  12  lease  6 

C. . .  case  5: 

X1=A1*B2;  X2=A2*B1;  X3=A1*B1;  X4=A2*B2 
IF  (XI  .BE.  X2)  R1=X1 

IF  (XI  .GT.  X2)  R1=X2 

IF  (X3  -LE.  X4)  R2=X4 

IF  (X3  .GT.  X4)  R2=X3  ;  GO  TO  30 
2  IF  (0.  .LT.  Bl)  GO  TO  14  lease  1 

IF  (B2  .LT.  0.)  GO  TO  16  lease  3 
C...  case  2: 

R1=A2*B1;  R2=A2*B2;  GO  TO  30 
4  IF  (0.  .LT.  Bl)  GO  TO  18  lease  7 

IF  (B2  .LT.  0.)  GO  TO  20  I  case  9 
C. . .  case  8: 

R1=A1*B2;  R2=A1*B1;  GO  TO  30 
10  R1=A1*B2;  R2=A2*B2;  GO  TO  30 

12  R1=A2*B1;  R2=A1*B1;  GO  TO  30 

14  R1=A1*B1 ;  R2=A2*B2;  GO  TO  30 

16  R1=A2*B1;  R2=A1*B2;  GO  TO  30 

18  R1=A1*B2;  R2“A2*B1;  GO  TO  30 

20  R1=A2*B2 ;  R2=A1*B1 

30  RFUZ  (1)  =R1;  RFUZ(2)=R2 

RETURN 
END 
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Abstract 

This  paper  presents  a  new  method  of  solving  operator  boundary  value  problems  applied  to  modelling  elec¬ 
tromagnetic  waveguides.  The  algorithm  is  based  on  a  concept  of  implicit  projection  of  operators  onto  finite¬ 
dimensional  domains.  It  is  shown  that  this  approach  allows  a  reduction  of  both  computation  time  and  memory 
requirements  necessary  to  solve  the  problem  at  hand.  Moreover,  the  presented  numerical  tests  indicate  that  the 
discussed  technique  may  be  efficiently  implemented  in  parallel  distributed  memory  systems  providing  means  to 
deal  with  large-scale  computational  problems  of  electromagnetics. 


1  Introduction 

The  necessity  of  solving  large-scale  problems,  arising  in  electromagnetics  creates  a  demand  for  more  efficient 
new  algorithms  being  able  to  deal  with  growing  complexity  of  the  modelled  systems.  One  of  widely  investigated 
approaches  towards  solving  large  and  complex  computational  problems  is  designing  scalable  algorithms  suitable 
for  use  in  parallel  distributed  memory  systems.  In  order  to  use  efficiently  both  multiple  processor  power  and 
distributed  memory  resources  of  these  systems  in  order  to  obtain  speed-up  of  computations  the  created  algorithms 
need  to  assure  workload  balancing  across  the  processors  and  minimize  the  inter-processor  data  movement.  In  the 
case  of  methods  of  solving  boundary  value  problems  for  infinite-dimensional  operators,  being  of  particular  interest 
in  computational  electromagnetics,  two  important  aspects  have  to  be  taken  into  account  as  to  obtain  a  scalable 
algorithm.  These  include:  1)  designing  a  method  of  mapping  of  the  initial  infinite-dimensional  operator  onto  a 
finite-dimensional  space  allowing  easy  and  efficient  parallelization,  2)  finding  a  scalable  algorithm  capable  of  solving 
the  emerging  finite-dimensional  problem  for  an  approximate  discrete  operator  (cf.  [1]). 

For  instance,  if  a  class  of  differential  operators  is  considered,  one  may  easily  find  out  that  the  Finite  Difference 
(FD)  projection  technique  allows  straightforward  and  very  efficient  parallelization  strategy  based  on  static  domain 
decomposition,  as  compared  to  Finite  Element  Method  (FEM)  which  requires  more  sophisticated  schemes  in  order 
to  assure  e.g.  load-balancing  of  parallel  computations. 

Referring  to  algorithms  of  solving  eigenproblems  for  finite-dimensional  linear  operators  one  has  to  mention  the 
Implicitly  Restarted  Arnoldi  Method  (IRAM)  [2],  [3],  which  is  a  modern  algorithm  capable  of  finding  eigenvalues 
of  a  finite-dimensional  non-symmetric  operator.  The  basic  iteration  of  the  method  (i.e.  the  Arnoldi  factorization) 
is  based  on  an  operation  of  creating  an  orthonormal  basis  in  a  Krylov  space  associated  with  a  given  operator. 
This  operation  may  be  easily  parallelized  and  its  implementation  in  distributed  memory  systems  involves  little 
inter-processor  communication  [4].  Consequently,  the  IRAM  is  found  to  be  a  highly  efficient  scalable  algorithm  for 
solving  eigenproblems,  capable  of  finding  several  eigenvalues  from  a  specified  part  of  the  finite-dimensional  operator 
spectrum,  e.g.  eigenvalues  with  the  largest  real  part,  and  allowing  one  to  reduce  significantly  both  computational 
complexity  as  well  as  memory  requirements  of  the  numerical  solvers.  Another  important  feature  of  the  discussed 
algorithm  is  that  during  the  Arnoldi  factorization  the  information  on  the  input  finite-dimensional  matrix  operator 
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T  is  passed  into  the  algorithm  only  via  the  matrix-vector  product  Zv.-  Consequently,  the  finite-dimensional  linear 
operator  does  not  have  to  be  represented  explicitly  by  a  matrix  of  elements.  This  feature  of  IRAM  together  with 
its  high  performance  in  parallel  systems  are  exploited  in  this  paper  to  propose  an  efficient  scalable  algorithm  ot 

solving  operator  eigenproblems  arising  in  electromagnetics.  „ 

The  problem  considered  within  this  study  is  finding 
modes  in  a  dielectric  waveguide,  shown  in  Figure  1, 
which  is  homogeneous  in  the  z  direction  and  has  an 
arbitrary  permittivity  profile  e(x,  y )  in  its  cross-section 
{x  -y  plane).  This  structure  may  be  modelled  by  the 
following  differential  equation  (cf.  [5]): 


Figure  1:  Schematic  of  a  dielectric  waveguide,  homogeneous 
in  the  z  direction  and  having  an  arbitrary  electrical  permit¬ 
tivity  profile  c(x,y)  in  the  x  —  y  plane. 


V\Ht  +  k%e(x,  y)Ht  +  ^  [Vte(x,  y )  x 

x  (Vt  x  Ht)]  =  0*Ht 


(1) 


where  V?(-)  =  (-),  Ht{x,y)  =  (Hx{x,y),Hy(x,y))  is  the  transverse  magnetic  field,  k0  is  the  wavenumber 

in  the  free  sp&cT,\(x,y)  is  the  permittivity  profile  in  the  *  -  y  plane  and  (3  is  a  propagation  constant.  In 
mathematical  terms  the  above  problem  may  be  viewed  as  an  eigenproblem  of  the  linear  operator  T  (given  by  the 
left-hand  side  of  equation  (1)):  _  .  . 

T  Ht  =  (?Ht  (2) 

with  the  transverse  magnetic  field  Ht  as  an  eigenfunction  and  /?2  as  an  eigenvalue  to  be  found  With  T  being 
a  second-order  non-symmetric  operator,  problem  (2)  provides  a  challenging  computational  task.  In  the  folbwing 
section  we  shall  present  an  method  of  projecting  this  operator  onto  a  finite-dimensional  space  which  allows  efficient 
parallelization  and  reduction  of  numerical  complexity  of  the  solver  based  on  Implicitly  Restarted  Arnoldi  Method. 

2  Cost-reducing  implicit  operator  projection 

As  already  mentioned,  in  order  to  make  use  of  the  Implicitly  Restarted  Arnoldi  Method  to  solving  boundary 
value  problem  (2)  one  has  to  apply  a  projection  of  the  initial  infinite-dimensional  operator  T  onto  a  finite¬ 
dimensional  space  to  obtain  an  approximate  finite-dimensional  operator  Z-  An  efficient  cost-reducing  projection 
method  emerges  if  one  applies  the  Method  of  Moments  approach  together  with  implicit  representation  of  the  finite- 
dimensional  operator  matrix.  If  a  complete  orthonormal  set  of  functions  {h^  m  the  domain  of  T  is  considered 
as  a  set  of  both  basis  and  testing  functions,  then  functions  from  the  operator’s  domain  may  be  represented  in  terms 
of  the  following  Fourier  expansions:  ^ 

u  =  ^(u,hi)/ii  (3) 

i=l 

Truncating  the  sequence  of  Fourier  coefficients  {/<}£ i  =  i  to  a  finite  number  of  terms  gives  a  finite 

representation  of  the  function  u: 


u=[{uM)Au,h2),---,{u>hn))J 


(4) 


The  method  of  discretization  of  operator  T  immediately  follows  from  the  above  representation  of  the  functions. 
Defining  the  elements  of  the  n  x  n  matrix  Z  —  [cij]?j=i  as. 


<Hj  =  (T  hjM) 


(5) 


one  obtains  a  finite-dimensional  linear  operator  being  a  mapping  of  operator  T.  One  may  note  that  if  the  elements 
c*  are  computed  explicitly,  a  dense  matrix,  known  from  the  Galerkin  method,  is  obtained.  This  brings  a  senes 
of  negative  effects.  Firstly,  the  matrix  storage  may  involve  n2  memory  locations  and  secondly  the  matrix-vect 
product  will  normally  involve  0(n2)  operations  which  may  cause  the  computation  time  in  the  methods  which 
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Figure  2:  Schematic  of  parallel  data  distribution  in  matrix-vector  product  design  for  the  implicitly  projected  operator  T 
and  transverse  magnetic  field  Ht  represented  by  2D  Fourier  expansions.  The  dashed  lines  mark  the  block  data  distribution 
pattern  across  the  processors. 


are  based  on  this  explicit  representation  (e.g.  Galerkin  Method)  to  blow  up  for  the  increasing  problem  size  n. 
Nevertheless  a  different  strategy  may  be  applied  based  on  the  following  property  of  operator  £ : 


Tu  = 


£(T  fjhj,h1),--,'E(Tfjhj,hn) 
i  J 


[(Tit, hi) (Tu, /in)] 


(6) 


where  u  =  /t^i-  In  view  °f  the  above,  the  matrix- vector  product  T_u  may  be  computed  using  the  following 

scheme: 


1.  Step  A:  Find  u  =  /i^i> 

2.  Step  B:  Compute  a  =  Tu, 

3.  Step  C:  Compute  (a, hm ),  m  =  1, . . .  ,n. 

As  one  may  note,  Step  B  will  generally  involve  computing  an  approximate  value  of  a.  One  of  the  possible  methods 
of  calculating  a,  applied  in  the  presented  solver,  is  based  on  sampling  of  function  u  at  N  grid-points  (n  <  N)  and 
replacing  differential  operations  on  field  components  (cf.  (1))  with  finite  difference  operations  on  a  discrete  field.  It 
is  also  clear  from  the  above  scheme  that  computation  of  the  matrix-vector  product  Tu  involves  in  fact  computation 
of  inner  products,  which  allows  one  not  to  store  the  elements  of  matrix  £  explicitly.  Still,  although  the  method 
of  computing  matrix-vector  product  presented  above  is  inevitably  more  complex  than  e.g.  in  case  of  a  simple  FD 
scheme,  the  gains  from  using  this  new  approach  become  apparent  if  one  considers  the  following  situation. 


If  the  boundary  value  problem  ( 1)  is  considered  on  a  rectangular  region  containing  a  cross-section  of  the  waveguide 
(cf.  Figure  1)  then  the  set  of  appropriate  trigonometric  functions  forms  an  orthonormal  basis  in  the  domain  of 
operator  T  and  may  be  used  to  represent  e.g.  transverse  magnetic  field  Ht .  Consequently,  the  proposed  scheme  of 
computing  matrix-vector  product  (illustrated  in  Figure  2)  consists  now  of  the  following  steps: 
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Figure  3:  Idea  of  the  parallel  backward  two-dimensional  FFT  algorithm  design.  Performing  a  parallel  forward  2D  FFT  is 
analogous. 


1.  Using  Fourier  coefficients  {/{}£=!  compute  the  values  of  the  function  u  at  the  discrete  grid  points  from  the 
spatial  domain  by  performing  a  two-dimensional  backward  discrete  FFT. 

2.  Calculate  the  values  of  the  Tit  function  at  the  grid  points  of  the  spatial  domain  using  the  previously  calculated 
values  of  u. 

3.  Compute  the  inner  products  (Ti2,  hi)  by  performing  a  two-dimensional  forward  discrete  FFT. 


3  Parallel  implementation  of  the  matrix-vector  product 

The  parallel  implementation  of  the  matrix-vector  product  based  on  implicit  operator  projection  and  2D  Fast  Fourier 
Transforms  applies  a  straightforward  parallel  block  data  decomposition  shown  in  Figure  2.  This  design  may  be 
used  together  with  IRAM  algorithm  as  to  obtain  an  efficient,  load-balanced  parallel  solver.  The  main  difficulty 
with  computing  the  discussed  matrix-vector  product  in  parallel  lies  in  performing  parallel  2D  FFTs.  Figure  3 
shows  a  schematic  of  parallel  design  of  a  backward  2D  FFT.  One  may  note  that  in  order  to  compute  Fast  Fourier 
Transform  in  the  y  direction  a  parallel  transposition  of  a  distributed  matrix  has  to  be  performed.  Referring  to 
distributed  memory  systems,  this  operation  involves  mainly  inter-processor  communication  which  in  the  proposed 
implementation  is  based  on  high  level  collective  communication  MPI  routine  MPI JUltoallvC . ) .  The  parallel 
design  of  forward  2D  FFT  is  analogous,  except  that  the  Fourier  Transform  in  the  y  direction  is  performed  first 
in  order  to  avoid  costly  matrix  transposition  while  computing  pairs  of  backward  and  forward  Fourier  transforms 
which  takes  place  during  calculation  of  the  matrix-vector  product. 

3.1  Computational  and  memory  cost 

The  numerical  cost  of  the  proposed  method  of  computing  matrix-vector  product  is  primarily  determined  by  the 
numerical  complexity  of  Steps  1  and  3  and  equals  0(K  log2  K)  for  a  single-processor  execution,  where  K  is  a  product 
of  the  FFT  lengths  in  x  and  y  directions.  At  this  point  it  should  be  noted  that  the  problem  size  N  is  usually 
considerably  smaller  than  K,  e.g.  by  25  times.  This  is  due  to  oversampling  which  should  be  applied  in  order  to 
obtain  more  accurate  values  of  inner  products  computed  using  FFT.  If  the  discussed  projection  method  is  compared 
to  Finite  Difference  discretization  it  is  apparent  that  computing  matrix-vector  product  requires  significantly  more 
operations  in  case  of  the  FFT-based  algorithm.  Still,  one  should  be  aware  that  the  oversampling  is  also  needed 
in  case  of  the  FD  discretization  due  to  the  requirements  of  numerical  dispersion  and  causes  an  increment  in  the 
problem  size  N.  This  does  not  apply  to  the  proposed  implicit  projection  of  the  operator  and  consequently  the 
extra  time  spent  on  calculating  the  matrix-vector  products  is  then  regained  by  spending  less  time  on  solving  an 
eigenproblem  with  a  reduced  size.  The  memory  requirements  of  computing  matrix-vector  product  involving  2D 
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FFTs  are  also  relatively  small.  Taking  into  account  the  memory  necessary  to  store  the  samples  of  the  function 
Ht  =  ( HX,HV )  in  the  spatial  domain  and  the  workspace  needed  to  perform  Fourier  transforms  the  memory  cost 
equals  roughly  (2 K  +  2N  +  Qy/N)  locations,  which  is  usually  significantly  less  as  compared  to  the  case  when 
operator  matrix  T  is  stored  explicitly. 


4  Numerical  results 


The  parallel  IRAM-based  solver  (IRAM-FFT)  applying  implicit  operator  projection,  based  on  the  public  domain 
library  P.ARPACK  [4]  and  MPI  implementation  of  the  matrix- vector  product  has  been  used  to  find  modes  in 
dielectric  waveguides  described  by  equation  (1).  In  the  numerical  tests  we  have  computed  propagation  constants 
for  an  elliptical  guide  with  the  semi-axes  ratio  ax/ay  =  2/1  and  a  continuous  permittivity  profile  given  by  the 
function: 


e(x,  y)  =  eo  1  -  {(x/ax)2  +  (y/ay)2)a/2 


(7) 


An  open  structure  was  modelled  by  taking  the  screening  walls  sufficiently  far  away  from  the  guide  (at  the  distance  of 
lOaj;  from  the  centre  of  the  waveguide).  The  results  presented  in  Table  1  show  a  comparison  of  the  non-dimensional 
normalized  propagation  constants  Z : 


02/H  ~  1 


computed  using  the  IEEM  [6]  and  the  IRAM-FFT  method  for  different  profile  exponents  a  (cf.  (7)).  In  the  tests 
the  normalized  frequency  V  =  ho  ■  2ax  ■  yjt  —  1  (e  =  2.25)  equalled  3,  the  number  of  expansion  functions  equalled 
75  in  x  and  y  directions  and  the  respectful  FFT  lengths  equalled  256.  It  is  apparent  that  the  obtained  results 
agree  almost  perfectly. 

Table  2  presents  a  comparison  of  the  normalized  propaga¬ 
tion  constants  /3/k0  for  a  slab  guide  with  discontinuous  per¬ 
mittivity  profile  (dimensions:  21.0  x  16.0  mm;  slab  located 
symmetrically  (width:  4  mm,  height:  16  mm);  core  permit¬ 
tivity:  e  =  2.56;  frequency:  15  GHz)  computed  with  IRAM- 
FFT,  Transverse  Resonance  Method  (TRM)  and  the  Galerkin 
Method  (GM)  with  QR-based  solver  [8].  In  the  tests  the  num¬ 


ber  of  expansion  functions  equalled  75  in  x  and  y  directions 
and  the  respectful  FFT  lengths  equalled  256.  The  results  show 
that  the  proposed  algorithm  may  also  be  applied  to  finding  fun¬ 
damental  and  higher-order  modes  in  guiding  structures  with 


Table  1:  Comparison  of  computed  normalized  prop¬ 
agation  constants  Z  for  an  elliptical  guide  with  e(z,  y) 
given  by  (8). 


discontinuous  permittivity  profiles. 


Apart  from  the  tests  validating  the  proposed  method  (IRAM-FFT)  also 
parallel  performance  tests  have  been  carried  out  in  a  Cray  T3E  scalable 
distributed  memory  system.  Figures  4  and  5  show  speed-ups  in  execu¬ 
tion  time  of  computing  a  pair  of  2D  FFTs  and  the  total  execution  time 
of  an  iteration  of  the  IRAM  algorithm.  As  one  may  note  the  speed-up 
improves  with  the  increasing  number  of  expansion  terms  and  FFT  lengths 
and  almost  a  perfect  speed-up  of  computations  is  obtained  for  the  available 
number  of  processors  as  N  =  2562  and  K  =  10242.  Figure  3  also  shows  the  Table  2:  Comparison  of  the  propagation 
performance  of  the  parallel  solver  (IRAM-FD)  based  on  IRAM  and  Finite  constants  computed  for  a  slab  guide 
Difference  mapping  of  operator  T  with  discretization  grid  200  x  100.  Fig-  wth  a  discontinuous  permittivlty  ProfiIe- 
ure  6  shows  a  comparison  of  single-processor  execution  times  of  IRAM-FD  and  IRAM-FFT  solvers  for  a  different 
number  of  eigenvalues  to  be  found.  In  case  of  the  IRAM-FFT  algorithm  the  number  of  expansion  functions  used 
to  represent  the  fields  equalled  40  in  every  spatial  direction  (IV  =  402).  For  IRAM-FD  algorithm  the  discretization 
grid  equalled  200  x  100.  One  may  note  that  if  FFT  lengths  equal  128  and  256  (in  every  spatial  direction)  the  IRAM- 
FFT  method  runs  faster  than  the  FD-based  solver.  As  the  FFT  length  increases  computation  of  the  matrix- vector 
product  becomes  more  costly  for  the  IRAM-FFT  algorithm,  causing  an  increment  of  the  total  execution  time. 
Still,  in  case  of  this  algorithm,  one  may  note  that  for  increasing  number  of  eigenvalues  to  be  found  the  growth  in 
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Figure  4:  Speed-up  in  the  execution  time  of  a  pair  of  operations:  a  backward  2D  FFT  and  a  forward  2D  FFT  as  a  function 
of  the  number  of  processors  applied,  for  different  numbers  of  expansion  functions  and  FFT  lengths.  The  tests  were  performed 
in  the  Cray  T3E  system. 


execution  time  is  slower  than  for  IRAM-FD  solver,  due  to  quicker  convergence  of  the  Implicitly  Restarted  Arnoldi 
Method  while  using  the  implicit  projection  of  the  operator. 


5  Conclusions 

It  has  been  found  that  the  proposed  method  of  solving  boundary  value  problems  based  on  Implicitly  Restarted 
Arnoldi  Method  and  implicit  operator  projection  may  be  implemented  in  an  efficient  scalable  numerical  solver 
which  can  be  applied  to  solving  large-scale  operator  problems  in  parallel  distributed  memory  systems.  It  has 
been  shown  that  the  scalability  of  the  solver  is  comparable  to  the  scalability  of  the  solver  based  on  the  Finite 
Difference  (FDFD)  discretization.  Still,  due  to  a  reduced  size  of  the  problem  generated  in  the  proposed  projection 
technique  a  fast  convergence  of  IRAM  is  obtained  and  memory  requirements  of  the  method  are  reduced.  Referring 
to  electromagnetic  applications,  the  numerical  tests  show  that  the  proposed  method  may  be  successfully  applied  to 
finding  modes  (including  higher  modes)  in  waveguides  with  both  continuous  and  discontinuous  permittivity  profiles. 
Clearly,  although  in  the  paper  the  discussion  concentrated  on  investigating  operators  arising  in  electromagnetics, 
the  presented  technique  may  be  applied  to  solving  eigenproblems  in  a  more  general  class  of  non-symmetnc  infinite- 
dimensional  operators. 
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Figure  5:  Speed-up  in  the  total  execution  time  of  a  single  iteration  of  the  IRAM-FFT  parallel  solver  vs.  the  number  of 
processors  for  different  numbers  of  expansion  functions  and  FFT  lengths.  The  tests  have  been  performed  in  the  Cray  T3E 
system. 


Number  of  eigenvalues  to  be  found 


Figure  6:  Comparison  of  single-processor  execution  times  of  IRAM-FD  and  IRAM-FFT  solvers  for  different  number  of 
eigenvalues  to  be  found  and  different  FFT  lengths  applied.  The  tests  have  been  performed  in  the  Cray  T3E  system. 
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Abstract 

We  describe  an  FD-TD  simulation  of  a  SAAB  2000  aircraft  using  more  than  one  billion 
cells.  This  was  performed  on  125  nodes  of  an  IBM  SP.  We  also  give  a  speed-up  result  for  the 
IBM  SP  and  discuss  the  possibilities  for  visualization  of  FD-TD  solutions  in  a  VR-CAVE. 


1  Introduction 

The  Swedish  center  of  excellence  Parallel  and  Scientific  Computing  Institute  (PSCI)  [6]  conducts 
a  research  project  within  its  CEM  program  called  “Large  Scale  FD-TD”  [5].  Within  this  project 
a  3D  FD-TD  code,  pscyee,  has  been  implemented  using  Fortran  90. 

The  capabilities  of  the  sequential  version  of  pscyee  includes  Huygens’  surfaces,  PML  according 
to  Berenger  as  absorbing  boundary  condition  (ABC),  a  generalized  thin  wire  subcell  model  [4] 
based  on  the  approach  of  Holland-Simpson  and  PEC  surfaces.  Excitation  may  also  be  performed 
with  current /voltage  sources  in  the  thin  wire  model  and  with  point  sources.  The  first  order  Mur 
ABC  is  available  for  calculations  where  the  accuracy  demand  on  the  ABC  is  not  so  high. 

Implementation  of  models  for  frequency  dispersive  materials  is  ongoing.  It  has  already  been 
implemented  in  a  2D  version  of  the  code,  pscyee  will  be  used  as  a  component  in  a  hybrid  time  do¬ 
main  solver  which  will  combine  the  FD-TD  scheme  with  an  implicit  Finite-Element  Time-Domain 
(FE-TD)  scheme  and  an  explicit  Finite- Volume  Time-Domain  (FV-TD)  scheme  for  unstructured 
meshes.  This  will  be  performed  within  the  GEMS  project  which  is  a  collaborative  code  develop¬ 
ment  project  between  Swedish  industry  and  academia  involving  PSCI,  Ericsson  Saab  Avionics, 
Ericsson  Microwave  System,  Saab  Ericsson  Space,  the  Swedish  Institute  of  Applied  Mathematics 
and  the  Swedish  Defense  Research  Establishment. 


2  Parallel  implementation 

The  parallel  implementation  of  pscyee  uses  the  Message  Passing  Interface  (MPI)  standard.  This 
was  chosen  to  guarantee  portability  of  the  code. 

Figure  1  illustrates  our  parallelization  strategy  in  ID  with  six  cells  on  two  nodes.  When 
node  two  calculates  the  EX4- value  it  needs  to  know  the  value  of  Hz3.5  which  is  stored  in  node 
one  Similarlv.  when  node  one  calculates  the  Hz3.5-value  it  needs  to  know  the  value  of  EX4 
which  is  stored  in  node  two.  This  means  that  during  each  time  step  two  messages  have  to  be 
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Figure  1:  Illustration  of  our  parallelization  strategy.  A  ID  Yee  grid  with  six  cells  (n=6)  and 
A y  =  0.5  distributed  on  two  nodes. 

sent,  one  in  each  direction.  In  3D  two  of  the  six  electromagnetic  variables  must  be  sent  across 
processor  interfaces.  We  have  implemented  this  strategy  with  several  syntactically  different  MPI 
implementations.  They  are  described  in  detail  in  [1]. 

On  a  parallel  computer  with  reasonably  fast  communication,  e.g.  an  IBM  SP,  it  is  possible 
to  achieve  perfect  scalability.  With  perfect  scalability  we  mean  that  when  the  problem  size  is 
increased  linearly  with  the  number  of  processors  the  execution  time  is  constant.  On  the  other 
hand,  it  is  usually  not  possible  to  achieve  ideal  speed-up.  An  exception  occurs  when  cache  effects 
makes  smaller  problem  sizes  faster  than  larger  ones.  With  speed-up  we  mean  that  the  problem 
size  is  kept  fixed  independently  of  the  number  of  processors.  Figure  2  displays  speed-up  results 
for  an  IBM  SP.  A  point  source  was  used  for  excitation  and  the  first  order  Mur  scheme  was  used 
as  ABC.  The  non  blocking  communication  of  MPI  (MPI.ISEND)  was  used. 

Portability  has  been  demonstrated  by  executing  the  code  on  several  different  parallel  comput¬ 
ers,  including  an  IBM  SP,  a  Fujitsu  VX,  a  CRAY  J932  and  a  cluster  of  Digital  Alpha  servers.  The 
performance  was  satisfactory  on  all  these  computers. 

A  more  detailed  descriptions  of  our  parallelization  strategy  can  be  found  in  [2,  1].  A  nice 
review  of  parallelization  of  FD-TD  is  given  by  Professor  Gedney  in  Chapter  5  of  [3]. 


3  One  billion  cells 

For  homogeneous  materials  we  need  only  six  floating  point  values  per  cell,  the  three  electric  and 
the  three  magnetic  field  components.  Using  32  bit  precision  (four  bytes)  means  that  we  need 

4  *  6  *  109  =  22.4  Gbyte  memory  for  one  billion  cells.  Clearly,  only  a  parallel  computer  can  supply 
this.  We  used  125  nodes  with  160MHz  RS/6000  processors  of  the  IBM  SP  at  PDC,  KTH.  These 
nodes  have  256  Mbyte  memory  each  making  a  total  of  31.25  Gbyte.  Actually,  a  few  of  them  have 
more  than  256  Mbyte,  but  we  do  not  want  to  use  it  since  that  would  destroy  the  load  balancing. 
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#  nodes 

Figure  2:  Speed-up  of  pscyee  on  an  IBM  SP  with  160MHz  RS/6000  processors.  The  problem  size 
is  252  x  252  x  127.  The  solid  line  represents  ideal  speed-up. 


It  is  not  possible  to  use  all  the  physical  memory  on  a  node  since  some  of  the  memory  is  used 
by  the  operating  system.  Using  too  much  memory  will  result  in  swapping  which  must  be  avoided 
since  it  has  a  drastic  effect  on  the  performance.  Tests  indicated  that  it  was  safe  to  use  up  to  200 
Mbyte  on  each  node  and  that  one  usually  could  use  up  to  220  Mbyte. 

The  object  chosen  for  this  calculation  was  a  SAAB  2000  aircraft.  A  Cartesian  description  of 
this  aircraft  was  created  from  a  CAD  description  using  CADFIX.  This  was  performed  by  Ericsson 
Saab  Avionics  (ESB).  The  file  delivered  from  ESB  only  contained  one  half  of  the  airplane  so  we 
had  to  create  the  entire  airplane  by  mirroring.  Hence,  we  got  an  absolutely  symmetric  airplane. 
The  resolution  was  2.5  cm  in  all  three  dimensions  which  was  a  factor  two  per  space  dimension 
smaller  than  the  resolution  previously  used  at  ESB.  We  used  a  problem  size  of  1260  x  1260  x  635 
equaling  1 008 126000  cells  in  total.  The  computational  domain  was  split  in  5  x  5  x  5  blocks  each 
with  a  size  of  252  x  252  x  127  cells. 

The  total  number  of  E-fields  on  the  surface  of  the  SAAB  2000  was  almost  two  millions  and  the 
number  of  surface  quads  was  almost  one  million.  The  memory  needed  to  store  this  information 
varied  from  node  to  node  and  was  at  most  nine  Mbytes. 

The  input  ASCII-file  containing  the  PEC  information  for  one  half  of  the  SAAB  2000  contained 
almost  one  million  lines.  Since  all  nodes  read  this  it  was  very  inefficient  to  use  the  standard  file 
system,  af  s,  which  only  gave  a  CPU  activity  of  2-3%  while  reading  the  file.  Instead  we  used  the 
parallel  file  system,  pf  s,  which  gave  a  CPU  activity  of  about  20%.  A  more  efficient  strategy  would 
probably  be  to  let  one  node  read  the  data,  analyze  it  and  then  distribute  it  to  the  other  nodes. 

The  performance  of  the  core  of  the  code  is  almost  25  Gflop/s.  This  figure  excludes  output. 
When  the  surface  currents  and  two  cutting  planes  were  written  every  20th  time  step  this  per¬ 
formance  dropped  to  about  20  Gflop/s.  In  this  case,  the  time  to  complete  one  time  step  was 
about  1.8  seconds.  A  total  of  2500  time  steps  were  taken  and  the  total  execution  time,  including 

initializations,  was  86  minutes.  . 

The  first  order  Mur  ABC  was  used  and  excitation  was  performed  with  a  point  source  in  front 
of  the  airplane.  The  MPI-SENDRECV  facility  was  used  for  the  communication. 
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Figure  3:  The  surface  currents  after  1500  time  steps  on  the  SAAB  2000  aircraft.  Also  the  magni¬ 
tude  of  the  H-field  is  shown  on  a  cutting  plane  across  the  wings  perpendicular  to  the  fuselage. 

4  Visualization  of  large  FD-TD  data 

Running  large  FD-TD  simulations  do  not  only  require  powerful  computers.  It  also  put  high 
demands  on  the  post  processors.  For  small  and  medium  size  problems  you  can  save  the  whole 
electromagnetic  field  at  each  time  step  (or  every  n:th  time  step)  if  you  have  sufficiently  large  discs. 
After  the  simulation  you  can  go  through  the  data  and  visualize  the  features  you  are  looking  for 
and  also  find  unexpected  properties  of  your  solution.  This  is  not  the  case  when  you  are  solving 
large  problems.  Not  only  the  disc  space  is  limited  but  the  I/O  bandwidth  is  also  an  effective 
bottleneck.  You  have  to  decide  a  priori  what  field  values  you  want  to  post  process  and  save  only 
them.  Furthermore,  you  need  a  high  end  graphical  system  to  visualize  the  multitude  of  polygons 
that  constitute  the  objects  in  your  simulation. 

A  new  technique  has  emerged  during  the  last  few  years  which  is  believed  to  make  the  un¬ 
derstanding  of  scientific  computing  results  easier.  The  concept  is  usually  called  “CAVE” ,  CAVE 
Automated  Virtual  Environment,  and  consist  of  back-projection  of  images  onto  semi  transparent 
surfaces.  If  several  surfaces  are  put  together  you  get  a  room  where  you  are  surrounded  by  the 
images.  By  adding  a  tracking  system  where  your  head  position  is  tracked,  stereo  images  can  be 
produced  and  highly  realistic  3D  environments  are  perceived.  With  a  tracking  system  for  a  hand 
held  device  you  can  also  interact  with  this  virtual  reality. 

This  technique  has  obvious  benefits:  it  is  easy  for  several  people  to  be  in  the  room  simulta¬ 
neously  and  therefore  see  images  together.  The  users  can  thus  interact  with  the  virtual  reality 
together  and  focus  on  interesting  areas.  Also,  because  users  see  their  own  hands  and  feet,  for 
example,  as  part  of  the  virtual  world,  they  get  a  heightened  sense  of  being  inside  that  world. 

Various  CAVE-like  environments  exist  all  over  the  world  today.  Most  of  these  have  up  to  four 


575 


Figure  4:  The  interior  of  the  SAAB  2000  aircraft.  Surface  currents  are  shown  at  the  same  time  as 
in  Figure  3. 

projection  surfaces;  images  are  usually  projected  on  three  walls  and  the  floor.  Adding  projection 
on  the  ceiling  gives  a  fuller  sense  of  being  enclosed  in  the  virtual  world. 

Projection  on  all  six  surfaces  of  a  room  allows  users  to  turn  around  and  look  m  all  directions. 
Thus  their  perception  and  experience  are  not  limited,  which  is  necessary  for  full  immersion. 
Such  a  six-surface-system  has  recently  been  inaugurated  at  PDC,  KTH  and  several  projects  on 
visualizing  CEM  solutions  have  been  started  where  the  users  will  be  able  to  navigate,  for  examp  e 
inside  an  aircraft  while  lightning  strikes.  In  this  case  one  will  directly  see  the  field  penetrating 
the  openings  of  the  fuselage  and  detect  “hotspots”  to  avoid  in  the  context  of  EMC. 

However,  the  CAVE  technology  does  not  ease  the  urge  of  effective  handling  of  the  output 
from  FD-TD  solvers.  Even  though  the  computers  serving  CAVE  environments  often  are  high  end 
graphical  systems  you  still  have  to  limit  the  data  saved  for  post  processing.  In  Figure  3  the 
surface  currents  are  displayed  on  each  2.5  x  2.5  cm2  square  constituting  the  surface  of  the  FD-TD 
object.  Approximately  one  million  quads  are  put  to  the  visual  system  and  clearly,  most  of  them 
are  not  visible  in  the  picture.  Furthermore,  perhaps  one  could  utilize  the  concept  “level  of  details 
where  smaller  parts  in  the  background  are  combined  to  fewer  objects  and  thus  lower  the  number 
of  polygons  to  be  rendered.  For  volumetric  data  semitransparent  3D  texture  mapping  can  be 
utilized.  This  volume  rendering  technique  can  be  used  to  show  the  field  inside  the  aircraft. 


5  Conclusions 

We  have  demonstrated  that  by  using  a  large  parallel  computer  it  is  possible  to  do  an  FD-TD 
calculation  with  more  than  one  billion  cells.  We  showed  this  by  using  125  nodes  of  an  IBM  SP. 

The  parallel  implementations  of  PML  and  the  subcell  model  for  arbitrarily  oriented  thm 
wires  [4]  are  in  progress.  When  they  are  completed  we  plan  to  repeat  the  one  billion  cells  cal¬ 
culation  using  these  features.  Excitation  will  be  performed  using  the  thin  wire  model.  The 
parallelization  approach  is  illustrated  in  Figure  5,  where  models  for  frequency  dispersive  materials 
are  also  included. 


Load  Balancing 


node 


P  =  PML,  L  =  Leap  frog,  W  =  Wire,  F  =  Freq.  disp.  mat. 

Figure  5:  The  load  balancing  for  a  full  problem,  including  PML,  frequency  dispersive  materials 
and  an  arbitrary  oriented  thin  wire  subcell  model. 
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Abstract.  To  improve  finite  difference  calculations,  the  use  of  higher  order  difference  operators  is  promis¬ 
ing.  For  the  time-stepping,  usually  only  the  second  order  centred  differencing  is  used.  Since  this  leads 
only  to  second  order  accurate  schemes,  independent  of  the  choice  of  the  difference  operator,  we  investigate 
classical  explicit  higher  order  multistep  methods.  Unfortunately,  they  do  not  satisfy  completely  the  needs 
for  the  use  with  Maxwell’s  or  telegraph  equations.  Therefore,  we  construct  new  explicit  multistep  schemes, 
which  show  stability  limits  comparable  to  centred  differencing  and  are  more  adapted  to  our  purposes.  This 
paper  discusses  the  construction  principles  of  the  new  schemes,  gives  the  stability  limits  as  well  as  the 
convergence  orders  and  error  constants,  and  validates  the  approach  by  applying  the  schemes  to  telegraph 
equations  using  several  difference  operators  for  the  approximation  of  the  spatial  derivatives. 


1.  Introduction 

The  development  of  higher  order  schemes  promises  algorithms  with  smaller  errors  than  standard  algo¬ 
rithms  but  comparable  computational  effort.  For  time  domain  calculations,  usually  centred  differences 
are  used  to  approximate  the  temporal  derivatives,  e.g.  [4]. 

In  [1],  we  also  applied  centred  differences  for  the  time-stepping  in  our  algorithm  using  biorthogonal 
B-spline- wavelets.  Jameson  pointed  out,  that  the  use  of  spline-wavelets  yields  higher  order  difference 
operators  [5].  Since  we  did  not  find  a  dramatic  reduction  in  the  error,  as  we  used  higher  and  higher  order 
difference  operators,  we  carried  out  a  dispersion  analysis  which  we  extended  to  the  use  of  general  linear 
multistep  methods  for  the  time-stepping  procedure  [2].  From  the  results  of  [2],  we  were  able  to  explain 
our  experience  made  in  [1].  We  concluded,  that  the  development  of  higher  order  temporal  integration 
schemes  is  necessary  to  exploit  higher  order  spatial  differencing  as  much  as  possible. 

In  the  subsequent  sections,  we  show  that  the  classical  explicit  multistep  schemes  like  Adams-Bashforth 
or  Nystrom  are  not  really  satisfactory  for  electromagnetics  since  there  must  be  no  amplification  or 
damping  of  travelling  waves.  Furthermore,  based  on  the  dispersion  analysis  of  general  linear  multistep 
methods,  we  show  how  to  construct  linear  multistep  methods,  that  do  not  amplify  or  damp  the  waves, 
give  their  convergence  order,  their  error  constants  as  well  as  their  stability  limits.  Numerical  experiments, 
using  our  new  time-stepping  schemes,  demonstrate  their  superiority  over  centred  differencing. 

2.  Multistep  Schemes 

If  we  discretize  spatially  a  time-dependent  partial  differential  equation,  e.g.  Maxwell’s  or  telegraph 
equations,  we  are  left  with  a  system  of  ordinary  differential  equations  which  has  to  be  integrated.  Ob¬ 
viously,  one  could  apply  standard  methods  for  solving  systems  of  ordinary  differential  equations.  This 
point  of  view  is  generally  known  as  method  of  lines. 

We  concentrate  on  linear  multistep  methods.  Their  exact  definition  can  be  found  e.g.  in  [6].  If  a  linear 
multistep  method  is  applied  to  telegraph  equations 
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which  are  spatially  discretized  and  the  spatial  derivatives  are  replaced  by  a  difference  operator  D  = 
Pn£>x  {$x  is  the  shift  operator  with  respect  to  x,  e.g.:  SxUk,i  —  ujy+i),  we  end  up  in  the 
algorithm  [2] 


<*L,k,  &L,k  £  R  ,  <*L,l  i=  0 ,  |Qi,o|  +  |/?L,o!  7*  0. 


m  denotes  the  time-step  and  l  the  point  in  the  spatial  grid.  For  explicit  schemes,  /?£,,£  =  0,  and  we  have 
to  know  umj  and  im;i  for  m  =  0, 1, . . .  ,  L  —  1  and  all  l  (so-called  starting  values)  to  calculate  ul,i  and 
*L,J- 

The  dispersion  relation  for  this  reads 


(3) 


£  otL,kejnk  m2 

*=? - =  ±cAt  J2 

£  0L,kejnk  n=-Ml 

k= 0 


where  ft  is  the  normalized  frequency  and  77  the  normalized  spatial  frequency. 

From  the  dispersion  relation  (3),  one  can  calculate  the  behaviour  of  the  numerical  modes  and  compare 
them  to  the  physical  ones.  Since  we  use  multistepping,  of  course  we  have  2 L  numerical  instead  of  2 
physical  modes.  Following  [7],  we  calculate  ft;,  i  =  1, . . .  ,  2L  as  function  of  77  for  all  2 L  numerical  modes. 
Requiring  that  all  ft;  must  not  have  negative  imaginary  parts  — a  negative  imaginary  part  would  mean 
exponential  growth —  allows  to  calculate  the  stability  limit.  The  evaluation  of 


(4) 


Cm  — 


AzRe{ftfc} 

AU) 


yields  the  numerical  phase  velocity  Cm  and  can  be  compared  to  the  physical  one  c.  Note  that  for  this 
comparison  to  make  sense,  k  has  to  be  chosen  correctly,  i.e.  we  have  to  choose  the  numerical  mode  that 
converges  to  the  physical  mode  and  not  one  of  the  purely  numerical  ones. 

Since  the  extraction  of  ft;  in  general  can  not  be  done  analytically,  the  calculation  of  the  modes,  the 
stability  limits  and  the  phase  velocities  is  carried  out  numerically. 

First  of  all,  we  looked  at  the  classical  explicit  multistep  methods,  i.e.  Adams-Bashforth  and  Nystrom 
type  schemes.  Nystrom’s  methods,  except  for  the  case  L  —  2  (centred  differencing),  are  unstable.  To 
achieve  imaginary  parts  of  ft;  greater  than  — 10-14,  one  has  to  use  inacceptably  small  time  steps  (remind 
that  these  investigations  have  been  done  numerically,  i.e.  we  have  to  choose  zero  a  little  bit  smaller  than 
0!). 

As  stability  limit,  we  give  7,  so  that  in  case  of  centred  differences  as  approximation  for  the  spatial 
derivative  At  and  Ax  must  fulfill  the  inequality 


(5) 


cAt 


If  one  uses  another  difference  operator,  one  has  only  to  calculate  the  maximum  value  of  the  imaginary 
part  on  the  right  hand  side  of  (3)  for  77  €  [— 7r,  7r]  and  divide  7  by  this  value,  e.g.  for  the  fourth  order 
difference  operator  given  below  one  has  to  multiply  the  stability  limit  7  by  0.7287. 

Additionally,  since  all  modes  show  an  imaginary  part  of  ft;,  we  give  the  maximum  value  of  the  imagi¬ 
nary  part  of  the  mode  that  corresponds  to  the  physical  mode  including  the  sign.  A  mode  gets  multiplied 
by  the  factor  exp(—at/At).  So  a  =  10-3  means,  that  after  103  time  steps  the  wave  is  damped  by 
e.  Similarly,  a  =  10“ 14  means,  that  after  108  time  steps  the  wave  is  amplified  by  exp(10-6),  i.e.  the 
amplitude  error  is  in  the  order  of  10-6. 
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Table  1.  Stability  limits  necessary  to  achieve  imaginary  parts  ^  -10  14  with  Adams- 
Bashforth  schemes.  The  maximum  imaginary  part  (including  sign)  of  the  physical  mode 
is  also  given. 


L 

1 

2 

3 

4 

5 

7 

1.421  •  10“7 

4.479  •  10~4 

0.7236 

0.4299  .0.005601 

a 

-10-14 

-lo-14 

0.135 

0.0028 

— 10-14 

L 

6 

7 

8 

9 

10 

7 

0.01831 

0.05808 

0.02948 

0.01496  0.007587 

a 

-10-14 

3.8  •  10"11 

io-14 

-10“14  7  - 10“ 15 

Table  1  gives  the  results  for  the  Adams-Bashforth  schemes.1  . 

£  =  l  is  unstable  and  leads  to  an  inacceptable  time  step.  L  =  2  leads  also  to  an  macceptable  time 
step,  since  it  is  only  second  order  accurate  and  centred  differencing  (7  =  1.0)  performs  much  better. 
The  cases  L  =  3  and  L  =  4  allow  good  time  steps,  but  the  schemes  are  dissipative.  The  cases  L  ?  •  & 
show  extremely  slight  amplification  or  damping,  so  this  would  be  acceptable,  but  the  time  steps  are  quite 
small. 


3.  Idea  of  Construction 

The  explanation  for  this  behaviour  is  simple  and  leads  directly  to  the  construction  of  a  new  family  of 
explicit  and  implicit  schemes  which  are  more  adapted  to  the  use  with  telegraph  and  Maxwell  s  equations. 
All  difference  operators  we  normally  use  are  antisymmetric,  e.g. 

•  Centred  differences:  D  =  2Sj(^  ~  S-cl) 

•  4th  order  differences:  D  =  +  8*5*  —  +  Sx  ) 

this  means,  that  the  right  hand  side  of  the  dispersion  relation  (3)  is  purely  imaginary  for  all  n  €  [-*,*]. 

We  observed,  that  only  for  Nystrom  with  L  =  2,  the  left  hand  side  is  also  purely  imaginary  for  real 
^  6  [-7r,7r].  In  all  other  cases  it  is  not.  Consequently,  U  needs  to  show  some  imaginary  part,  so  that  the 
term  on  the  left  hand  side  of  (3)  also  becomes  purely  imaginary. 

The  idea  for  constructing  the  new  schemes  now  comes  clear.  The  coefficients  aL,i  have  to  be  antisym¬ 
metric 

(6)  &L,L-i  =  -<*£,»  ^  *  €  {0, . . .  ,  L} 

and  the  coefficients  have  to  be  symmetric 

(7)  &L,L~i  =  PLj,  i  €  { 0, . . .  ,  A}  • 

4.  New  Family  of  Schemes 

Of  course  the  choice  of  aL,i  and  (3L,i  is  restricted  (see  [6]  for  details). 

Nevertheless  choosing  aL,t  =  1,  =  -1  and  aLi  =  0,  i  =  1, . . .  ,  i-1  allows  to  construct  comment 

multistep  schemes  which  are  weakly  stable  and  have  the  nice  property  to  have  no  exponentially  growing 
numerical  modes,  if  the  time  step  is  chosen  small  enough,  and  no  damping. 


1  Since  the  calculations  have  been  carried  out  numerically,  we  only  give  the  first  digits. 
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If  L  is  odd,  it  is  not  possible  to  construct  an  explicit  scheme  where  the  are  symmetric  (if  one 
wants  to  have  the  highest  convergence  order  possible) .  Consequently,  these  schemes  exhibit  exponentially 
growing  modes.  Therefore  we  do  not  give  their  coefficients  nor  their  characteristics. 

Table  2  gives  the  coefficients,  and  table  3  the  convergence  orders,  the  error  constants  [6]  and  the 
stability  limits  for  the  new  schemes.  The  stability  limits  given  have  been  verified  in  our  experiments,  i.e. 
increasing  them  by  only  1%  leads  to  rapid  exponential  growth. 

Table  2.  Coefficients  of  the  explicit  schemes. 


Length  L 


Coefficients  •  •  •  ,  Pl,q 


2 


0,2,0 


4 

6 

8 

10 


n  33  21  39  21  33  n 

u>  10’  5  ’  5  ’  5  >  10’u 


a  736  _ 848  _ _ 

u’  189  >  105 »  105  1 


19672  1952  848  736  n 

"  945  ’  105  ’  105  ’  189  ’  u 


4175  41675 
'  324  ’  1143  ’ 


137675  169555 
2268  ’  2268  ’ 


'  324  ’  4536  ’ 


Table  3.  Characteristics  of  the  explicit  schemes. 


Length  L 

Convergence 

Order 

Error 

Constant 

Stability 
Limit  7 

2 

2 

l 

6 

1.0 

4 

4 

7 

90 

0.4330 

6 

6 

41 

840 

0.1143 

8 

8 

989 

28350 

0.02949 

10 

10 

16067 

598752 

0.007587 

We  compare  these  new  schemes  with  the  Adams-Bashforth  cases.  The  case  L  =  2  gives  centred 
differencing.  L  =  4  gives  a  slightly  greater  stability  limit  than  Adams-Bashforth,  but  shows  of  course  no 
damping.  L  =  6  increases  the  possible  time  step  by  a  factor  of  6  and  also  shows  no  damping.  L  =  8  and 
L  —  10  seem  to  have  no  advantages,  since  the  stability  limits  axe  the  same  and  the  Adams-Bashforth’s 
schemes  have  practically  no  imaginary  part. 

A  disadvantage  of  the  new  schemes  is,  that  they  have  greater  memory  requirements  than  Adams- 
Bashforth’s  schemes. 

To  illustrate  the  dispersion  behaviour  of  the  new  schemes,  the  error  of  the  phase  velocity  in  the 
maximum  norm  ||cm|  -  c|  (remind  that  Cm  can  be  positive  or  negative,  depending  in  the  sign  of  the  right 
hand  side  of  (3))  is  given  in  the  plots  in  Fig.  1.  The  plots  show  the  errors  for  all  five  integration  schemes 
given  in  this  paper  for  four  different  difference  operators.  All  calculations  have  been  carried  out  at  the 
stability  limits  given. 

Top  left,  we  used  the  fourth  order  difference  operator  given  above.  If  centred  differencing  (L  —  2)  is 
used,  only  second  order  convergence  can  be  achieved;  if  one  uses  a  more  accurate  integration  scheme,  the 
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FIGURE  1.  Phase  velocity  errors  of  the  new  schemes  for  several  higher  order  (spatial) 


difference  operators. 


Fourth  order  difference  operator 


f\ 


(2,4)-B-splines 


V 


(2,8)-B-splines 


V 


I 


(2,12)-B-splines 


V 


fourth  order  convergence  of  the  difference  operator  can  be  obtained.  Note  that  increasing  the  order  of 
the  integration  scheme  to  more  than  four  does  not  improve  the  convergence  behaviour  any  further. 

The  other  three  plots  show  the  phase  velocity  errors  for  the  difference  schemes  obtained  by  applying  a 
Petrov-Galerkin  procedure  using  the  primal  scaling  functions  of  the  biortogonal  B-spline-wavelets  [3]  as 
expansion  functions  and  the  dual  scaling  functions  as  test  functions  [1].  We  give  results  for  (2,4)-,  (2,8)-, 
and  (2,12)-B-splines.  The  resulting  difference  operators  seem  to  be  of  convergence  orders  6,  10  ,  and 
14  (taken  from  the  behaviour  of  the  error  of  the  phase  velocity).  As  expected,  the  integration  scheme 
L  =  2  leads  to  second  order  convergence,  L  =  4  to  fourth  order.  The  other  three  integrators  can  t  be 
distinguished.  We  believe  that  their  error  constants  are  so  small,  that  we  can’t  observe  the  reduction  in 
the  convergence  order,  due  to  the  integration  schemes,  using  double  precision  arithmetic.  From  this  we 
conclude  that  1  =  8  and  L  =  10  should  not  result  in  smaller  errors  than  L  =  6,  if  we  use  single  or  double 
precision  arithmetic. 

5.  Numerical  Validation 

For  numerical  validation  of  the  new  integration  schemes,  we  apply  them  to  telegraph  equations  (1) 
with  c  =  1.  As  boundary  conditions  we  have  to  choose  something  that  does  not  disturb  the  homogeneous 
scheme  for  infinite  spatial  extension.  Possible  choices  are  periodic  boundary  conditions  and  ideal  electric 
or  magnetic  walls  using  the  mirror  principle.  Our  test  line  is  defined  on  the  interval  [0, 1].  For  this 
problem,  the  analytic  solution  is  known.  For  tn  =  2 n,n  €  IN,  voltage  and  current  must  equal  the  initial 
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Figure  2.  k, Ax-errors  in  dependence  on  the  discretization. 
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N 
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condition  for  all  three  possible  boundary  conditions.  As  initial  condition,  we  set 


(8a) 

u0  =  < 

(8b) 

K 

io  =  0 

108(z  -  0.4)4(x  —  0.6) 
0 


4 


if  0.4^  x  <0.6 
else 


As  starting  values  for  the  multistep  schemes,  we  use  the  analytic  solution  to  avoid  additional  errors. 

Fig.  2  gives  the  ^2, Ax-errors  [7]  for  the  fourth  order,  (2,4)-,  (2,8),  and  (2,12)-B-spline  difference  operators 
for  all  five  integration  schemes. 

For  centred  differencing  (L  —  2),  we  have  a  convergence  order  around  3.  From  the  dispersion  analysis, 
we  would  expect  only  second  order.  The  reason  for  this  faster  convergence  is  not  clear. 

As  can  be  seen,  increasing  the  order  of  the  integrator  decreases  dramatically  the  error.  But  the 
convergence  order  is  limited  by  something  around  fourth  order.  The  reason  for  this  lies  in  the  regularity 
of  our  initial  condition.  Its  Fourier  transform  decreases  as  1/a;4.  This  means  even  though  we  have 
excellent  schemes,  if  we  don’t  excite  them  smoothly  enough,  we  get  a  worse  convergence  behaviour. 

If  we  choose  uq  to  be  smoother  (seven  times  continuously  differentiable) 


(9) 


uo  = 


1016(a;-0.4)8(s-0.6)8 

0 


if  0.4  <  x  s$0.6 
else 


we  can  achieve  higher  convergence  orders,  since  its  Fourier  transform  now  decreases  as  1/a;8;  see  fig.  3. 
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FIGURE  3.  Error  behaviour  for  the  smoother  initial  conditions. 
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Again,  the  reason  for  the  ’too  fast’  convergence  of  the  cases  L  =  2  and  L  =  4  is  not  clear.  The  others 
behave  like  predicted  from  the  dispersion  analysis  and  the  regularity  of  the  initial  value.  One  can  see 
also  very  nicely  the  influence  of  the  round-off  errors,  which  limit  the  achievable  accuracy  to  something 
around  a  hundred  times  the  relative  accuracy  of  double  precision. 

6.  Conclusion 

We  investigated  explicit  higher  order  multistep  methods  for  their  usefulness  with  telegraph  and 
Maxwell’s  equations.  We  found  Nystrom’s  methods  except  for  the  centred  differencing  case  L  -l 
to  be  useless.  Adams-Bashforth’s  methods  might  be  of  use,  if  one  accepts  a  (by  the  time  step)  tunable 
growth  or  damping.  Of  course,  this  complicates  the  choice  of  the  time  step.  , 

To  overcome  these  limitations,  we  constructed  a  new  class  of  multistep  schemes,  which  doesn  t  show 
any  amplification  or  damping  of  the  physical  modes.  This  new  family  includes  centred  differencing  as 
special  case.  Especially  the  cases  L  =  4  and  L  =  6  are  interesting,  since  they  behave  better  than  the 
Adams-Bashforth  pendants.  The  higher  order  cases  show  no  significant  improvement  with  respect  to  the 
Adams-Bashforth  cases,  since  the  practical  stability  limits  seem  to  be  comparable  but  the  new  family 
has  greater  memory  requirements. 

Furthermore,  we  showed  that  our  approach  leads  to  extremely  small  errors  with  moderate  computa¬ 
tional  effort  for  the  simple  telegraph  equations. 

The  application  of  multistep  methods  can  be  easily  extended  to  Maxwell’s  equations.  For  a  linear, 
homogeneous,  isotropic,  lossless  and  frequency-independent  medium  and  a  cubic,  non-graded  mesh,  we 
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axe  left  with  the  algorithm 
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Again,  m  denotes  the  time-step  and  l,  i,j  the  point  in  the  three-dimensional  spatial  grid.  The  first  partial 
derivatives  djdx,  djdy,  d/dz  are  approximated  by  the  difference  operators 


M2  X  A^2y  M2z 

D*=  £  Dv  =  £  «>«?■  D‘=  T.  r «S? 

n=—Mix  n——Miy  n=— M\t 

Sx,Sy,Sz  are  the  shift  operators  with  respect  to  the  2,  y,  2-axis,  e  is  the  dielectric  constant  and  ft  the 
magnetic  constant  for  the  medium. 

The  dispersion  relation  is  given  by 

L 


E  awe#" 


■  =  ±cAf 


E  @L,kejak 

k= 0 


m2x 


■Mi, 


M2  y 


53  P*6™*"  +  I  53  9ne^'^n  +  53  rn^Vzn 


n——Miy 


M2z 


n——Mu 


The  implementation  of  algorithms  of  this  type  is  on  its  way,  and  hopefully,  we  are  able  to  present  results 
soon. 
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Abstract.  In  this  paper,  the  generalized  theory  for  perfectly  matched  layers  (GT-PML) 
that  was  presented  as  a  systematic  development  of  the  unsplit-field  formulation  of  perfectly 
matched  layers  in  Cartesian  coordinates,  is  extended  to  investigate  the  absorbing  properties 
of  perfectly  matched  layers  in  cylindrical  and  spherical  coordinates.  Several  differences  are 
identified  between  PML  in  Cartesian  coordinates  and  PML  in  cylindrical  or  spherical 
coordinates.  It  is  concluded  that  no  PML  can  be  constructed  in  cylindrical  and  spherical 
coordinates  as  it  has  been  done  in  Cartesian  coordinates.  The  theory  presented  in  this 
paper  is  also  validated  by  FDTD  simulation  on  the  cylindrical  grid. 


1.  Introduction 

With  the  advent  of  Berenger’s  theory  of  perfectly  matched  layers  (PML)  [1]  and 
its  subsequent  variations  [2]- [4],  numerical  reflections  from  the  grid  truncation  boundary 
in  finite-difference  time-domain  (FDTD)  and  finite  element  (FE)  solutions  of  unbounded 
electromagnetic  problems  can  be  kept  at  levels  low  enough  to  facilitate  numerical  solutions 
of  high  accuracy.  The  most  successful  implementations  of  PMLs  using  either  Berenger’s 
original  split-field  formulation  or  the  unsplit-field  formulation,  has  been  demonstrated  in 
Cartesian  coordinates.  More  recently,  several  investigators  have  proposed  the  ways  for 
the  extension  of  the  original  PML  theory  to  cylindrical  and  spherical  coordinate  using  the 
mapping  technique  in  modified  Maxwell’s  system  and  the  split-field  components  [5]- [8]. 

In  this  paper,  a  systematic  derivation  of  the  anisotropic  absorbing  medium  in  orthog¬ 
onal  curvilinear  coordinates  is  presented,  based  on  the  Maxwellian  physical  system,  to 
construct  PML  in  orthogonal  coordinates  in  addition  to  Cartesian  coordinates.  The  devel¬ 
opment  begins  with  the  conjecture  that,  as  in  the  original  formulation  of  the  generalized 
theory  of  PML  (GT-PML)  in  Cartesian  coordinate  [4],  the  perfectly  matched  layers  can  be 
presented  by  the  anisotropic  permittivity  and  permeability  tensors  e  =  e[A]  and  ji  —  A] 
with  diagonal  elements  A,.  This  conjecture  is  motivated  by  the  tensor  that  describes  the 
anisotropic  matched  absorbers  in  Cartesian  coordinates  [4].  It  is  shown  that  this  form 
of  the  permittivity  and  permeability  tensors  can  be  translated  to  a  modified  definition 
of  the  V  operator  in  orthogonal  curvilinear  coordinates,  with  the  generalized  variables 
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Vi(i  =  1,2,3),  in  order  to  investigate  the  reflection  properties  of  propagating  wave  at  the 
interface  between  the  absorbing  layer  and  the  computational  domain,  and  construct  the 
reflectionless  absorbers.  Finally,  such  a  mathematical  framework  of  the  generalized  theory 
of  perfectly  matched  layers  leads  to  that  either  cylindrical  or  spherical  PMLs  do  not  behave 
exactly  like  the  ones  developed  for  Cartesian  grids,  and  a  reflectionless  interface  can  not 
be  affected.  This  result  is  validated  through  a  numerical  experiment  on  cylindrical  grids. 

2.  Maxwell’s  Equation  in  Orthogonal  Curvilinear  Coordinates 

With  the  assumption  of  a  time  dependence  of  the  form  e-,u'* ,  Maxwell’s  equations  in 
anisotropic  medium  have  the  form 


V  x  E  =  — joj/j[A]-H 

(la) 

V  x  H  =  jwe[A]-E 

(16) 

V-(e[A]-E)  =  0 

(1c) 

V-MA]-H)=0 

(Id) 

where  the  tensor  [A]  =  diag{Ai,  A2,  A3}  is  determined  in  orthogonal  curvilinear  coordi¬ 
nate  {ui,W2,  W3},  and  the  elements  of  the  diagonal  matrix  are  complex,  dimensionless  and 
constants.  For  example,  the  component  of  Maxwell’s  first  equation  and  the  divergence 
equation  for  the  electric  field  have  the  form 

(2a) 

hlLt^cXlhihkE>)= 0 

(2  b) 

To  determine  the  elements  of  the  tensor  of  PML,  we  introduce  the  scaling  of  physical 
electromagnetic  field 

E  =  [Q]E,  H  =  [Q]H  (3) 


where  the  matrix  [Q]  =  diag{qi,q2,qz}  is  a  coordinate-independent  constant  matrix.  The 
component  forms  of  Maxwell’s  curl  equations  with  the  scaling  components  of  electromag¬ 
netic  field  are  cast  by 

-  ik{  ikbh>qA)  ~ 

-*"*** = -  ££<*•**» 

~^3 = 


(4a) 

(46) 

(4c) 
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(5) 


Also,  the  divergence  equation  for  the  electric  field  becomes 


_ 1 _ v - i - ^-{hjhkEi)  =  0 

h\h2hz  Xj^kQjQk  oui 

Comparing  the  coefficients  of  the  derivative  items  in  (4)  suggests  the  following  expressions 
for  the  determination  of  the  scaling  constants  qi  in  terms  of  the  elements  of  tensor  matrix 


ii 

C?l  © 

1"< 

11 

c£l  O’ 

tfO  I  <N 

II 

©1  & 

(6) 

Defining  the  constants 

1  _  qk 
si  ^jQj 

(7) 

the  relations  of  (6)  yield 

Si  =  \A  j^k 

(8) 

Next,  after  introducing  the  variable  change  Vi  =  SjUj,  Maxwell’s  equations  with  scaling 
fields  may  be  written  in  orthogonal  curvilinear  coordinate  (vltv2,vz)  as  follows 

VQ  x  E  =  -jun H 

(9a) 

VsxH  =  juje  E 

(9  b) 

< 

ii 

o 

(9c) 

Va-  (/iH)  =  o 

(9  d) 

where 

3  ^ 

(10a) 

<1 

hrj> 

II 

(106) 

Equation  (9)-(10)  may  be  used  for  the  construction  of  the  anisotropic  PMLs  in  various 
orthogonal  coordinate  systems,  although  the  generalized  variables  may  be  adjusted  in 
terms  of  the  scale  factor  hi  of  the  special  coordinates  system. 


3.  PML  in  Cartesian  Coordinates 

For  a  plane  wave  in  Cartesian  coordinate  system  (hi  =  h2  =  hz  =  1),  the  general 
solution  of  (11)  is  the  familiar  expression 

E  =  E0e"ik‘v  (12a) 
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up H  =  -k  X  E  (126) 

where  k  =  kxex+kyey  +  kzez ,  ex,ej,,e2  axe  the  unit  vectors,  andv  =  vxex+vyey+vzex  = 
sxxex  +  syyey  +  szzez.  The  apparent  wavenumbers  along  x,  y,  z  direction  satisfy  the 
dispersion  equation 

k2x  +  kl  +  k\  -  k2  =  u2ep  (13) 

where  kx  =  ksinOcos<f>,  ky  —  ksindsin<f>,  kz  =  kcs9 ,  and  fc  =  u^fpi.  Without  loss  of  gener¬ 
ality,  the  planar  interface  between  the  computational  domain  and  anisotropic  medium  is 
taken  to  coincide  with  the  z  =  0  plane.  The  medium  1  (z  <  0)  is  filled  with  a  homogeneous 
medium  with  tensors  ei[Ai],  /zi[Ai],  where  [Ai]  =  diag{X\x,  Aiy,  Ai*},  and  corresponding 
six,  sij,  and  s\z  values  given  by  (8).  The  medium  2  [z  >  0)  is  filled  with  a  homogeneous 
medium  with  tensors  €2[A2],  ^2^2],  where  [A2]  =  diag{X2x,  X2y,  A2z},  and  corresponding 
s2x,  s2y  and  s2z  values  given  by  (8).  A  plane  wave  propagating  from  medium  1  to  ward 
medium  2  is  assumed  to  be  obliquely  incident  on  the  interface.  Its  polarization  is  assumed 
to  be  arbitrary,  and  the  standard  decomposition  of  the  field  in  its  transverse  electric  to  z 
{TEZ)  and  transverse  magnetic  to  2  ( TMZ )  parts  is  used  for  the  purposes  of  investigating 
the  reflection  properties  of  the  interface.  Following  the  procedure  in  [4],  it  can  be  shown 
that  the  interface  is  reflectionless  provided  that  =  €2, 1^1  —  fJ-2,  $1*  =  s2xi  Sly  =  $2 y,  and 
s2z  =  1  -f  s"/ju}.  On  the  basis  of  those  results,  the  elements  of  the  tensor  for  medium  2 
are  constructed  in  such  a  manner  that  the  propagating  wave  is  attenuated  in  the  medium. 
The  details  for  such  a  construction  are  given  in  [4]. 


4.  PML  in  Cylindrical  Coordinates 


As  we  mentioned  in  the  Introduction,  the  cylindrical  PMLs  have  been  proposed  by  sev¬ 
eral  groups  [5]- [8].  However,  in  contrast  to  the  case  of  cartesian  coordinates,  the  numerical 
performance  of  cylindrical  PMLs  is  not  well  documented.  We  approach  the  development 
of  PMLs  in  cylindrical  coordinates  in  a  manner  that  is  different  from  the  ones  used  in  the 
aforementioned  references. 

We  define  the  anisotropic  tensor  [A]  =  diag{Xp,  Az},  and  introduce  a  set  of  scaled 
fields  through  the  relations  in  (3),  where  [£J]  =  diag{qPfq<j,,qz}.  With  the  scale  factor  of 
cylindrical  coordinates,  hp  =  1,  h$  =  p  and  hz  —  1,  we  seek  the  relationship  between  the 
scaling  coefficients  and  the  elements  of  the  anisotropic  tensor  that  allow  the  Maxwellian 
system  of  (1)  to  be  cast  in  a  generalized  variable  form  that  is  consistent  with  cylindrical 
coordinates.  The  development  is  straightforward  and  leads  to  the  following  results: 
a)  Relationship  between  q’s  and  A’s: 


b)  Generalized  coordinate  variables: 


p  =  zp  =  s'pp, 


z  —  yj  XpXpz  —  &zz 


(14) 


(15) 
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The  remaining  two  Maxwell’s  equations  are  obtained  easily  from  (16)  using  duality. 

The  reflection  properties  of  a  cylindrical  interface  between  a  homogeneous  medium 
and  a  PML  can  be  affected  by  considering  separately  the  TE*  and  the  TM2  cases.  In  view 
of  (16),  the  general  solution  for  the  z  components  of  the  scaled  electric  field,  Ez,  and  the 
scaled  magnetic  field,  Hz,  for  the  TMZ  and  TEZ  case,  respectively,  are  of  the  form 

=  A0'n  Hn'2){kpP)cos[n$)e~3kzZ  (17a) 

p  n 

Hz  =  5Z  S  BP^Hn’2KkpP)sin(n^)e~iklZ 

p  n 

where  Hn  \  Hn  J  axe  the  nth-order  Hankel  functions  of  the  first  and  second  kind,  respec¬ 
tively,  while  k2+k2  =  k2. 

For  a  truncation  boundary  along  an  interface  perpendicular  to  the  2  axis,  it  is  apparent 
from  (17a, b)  that  continuity  of  the  tangential  fields  along  the  interface,  requires  continuity 
of  the  tangential  generalized  coordinates  along  the  interface.  Hence,  $p  =  =  1.  In 

addition,  in  order  to  effect  attenuation  inside  PML,  the  scaling  coefficient  along  z  must  be 
chosen  as  s’z  —  w  =  1  +  w" /jus.  From  (16)  it  is  immediately  clear  that  these  conditions 
axe  easily  satisfied  for  the  following  choice  of  the  tensor  [A]  inside  the  PML 

[A]  =  diag{w,w,  — }  (18) 

For  a  PML  perpendicular  to  the  ^-direction,  continuity  of  the  tangential  components 
of  the  scaled  fields  along  the  interface  requires  continuity  of  the  generalized  coordinates, 
4>,  z,  which  implies  sj  =  s'z  =  1.  In  addition,  the  choice  s'p  =  w  must  be  made  in  order 
to  effect  attenuation  inside  the  PML.  In  view  of  (16),  these  choices  lead  to  the  following 
form  for  the  anisotropic  tensor  inside  the  PML 

[A]  =  diag{l,l,w2}  (19) 
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and  the  scaling  factors  become,  qp  —  =  1,  qz  —  1/w. 

The  last  result  implies  that  even  though  the  scaled  z  components  of  the  fields  will  be 
continuous  at  the  interface,  the  corresponding  physical  components  will  not  be  continuous 
unless  w  =  1.  In  other  words,  unless  w  is  allowed  to  increase  from  a  value  of  1  at  the 
interface  in  a  continuous  fashion  inside  the  PML,  a  reftectionless  interface  cannot  be  af¬ 
fected.  However,  such  a  continuous,  gradual  growth  of  w  can  not  be  obtained  because  of 
the  discrete  implementation  of  PML.  Clearly,  this  is  in  direct  contrast  with  the  result  in 
cartesian  coordinates  where  PMLs  could  be  constructed  with  any  complex  constant  value 
of  w  =  1  +  w"  (ju)  other  than  1. 

In  order  to  confirm  this  theoretical  result  for  cylindrical  PML,  an  unsplit-field  FDTD 
implementation  was  attempted  on  a  two-dimensional  cylindrical  grid  to  validate  the  pos¬ 
sibility  that  PML  can  be  realized  The  grid  was  constructed  with  increments  along  the 
p  and  4>  of  A p  =  1.5cm  and  A<f>  =  4°,  respectively.  A  ^-directed  infinite  line  source  at 
point  (50Ap,45A<£)  inside  a  (70Ap  x  90A<£)-cell  domain,  SlN,  was  excited  by  the  smooth 
compact  pulse  of  [9].  The  computational  domain  was  truncated  with  a  16-cell  PML  backed 
by  a  cylindrical  perfectly  conducting  boundary.  The  benchmark  FDTD  solution  with  zero 
truncation  boundary  reflection  was  obtained  by  simulating  the  radiation  by  the  aforemen¬ 
tioned  line  source  in  a  much  larger  domain  fix,,  of. outer  radius  p  =  400 A/?,  terminated 
by  a  perfectly  conducting  circular  boundary.  The  time-step  for  the  FDTD  simulation  was 
1.57ps.  Fig.  1  depicts  the  global  error  over  the  domain  fijv-  Despite  the  rather  long,  16-cell 
PML,  the  global  error  increases  slowly  with  time,  indicating  the  presence  of  reflection  from 
the  cylindrical  PML. 


5.  PML  in  Spherical  Coordinates 


Following  the  above  procedure,  We  define  the  anisotropic  tensor  [A]  =  diag{Xp,  A$,  A^} 
as  well  a  scaling  matrix  of  fields  [Q]  =  diag{qp,  qe,  q<i>}.  With  the  scale  factor  of  cylindrical 
coordinates,  hp  =  1,  h9  —  p  and  h<t>  =  psind,  the  relationship  between  the  scaling  coeffi¬ 
cients  and  the  elements  of  the  anisotropic  tensor  is  determined  and  the  derivation  leads  to 
the  following  results: 
a)  Relationship  between  q's  and  A’s: 


A*  fSL\2  =  ^t 
\q$J  VW  ^ ' 


b)  Generalized  coordinate  variables: 


p  =  y/\9\4 ,p  =  S'pp , 


(20) 


(21) 


Also,  for  a  PML  perpendicular  to  the  /^-direction,  the  continuity  of  the  tangential  compo¬ 
nents  of  the  fields  along  the  interface  requires  s’e  —  =  1.  In  addition,  s’p  should  be  a 

complex  constant  to  effect  attenuation  inside  the  PML.  These  choices  lead  to  the  following 
form  for  the  anisotropic  tensor  inside  the  PML 


[A]  =  diag{w ,  w,  tt?} 


(22) 
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and  the  scaling  factors  become,  qp  =  qe  -  q<t>  =  1- 

In  terms  of  the  generalized  variables  defined  the  above,  Faraday’s  law  and  Gauss’  law, 
(9)-(10),  for  the  electric  field  have  the  form 


l.dEj 

p[  d6 


^  JEjQ  ~ 

+  EfiCote  -  -££■)  = 


(23  a) 


1  dE0  dEs  E, 


psind  d<fi  dp 


(23  6) 


{23c) 

— '■-[V-tfsineE,)  +  ^(psinOEe)  +  P& *)]  =  0  (23  d) 

Clearly,  such  a  medium  is  the  same  as  the  usual  isotropic  lossy  medium.  Although  some 
PML  requirements  are  satisfied,  it  is  impossible  to  realize  the  reflectionless  PML  in  this 
kind  of  lossy  medium,  since  the  factor  aj,  in  the  expression  of  fields  leads  to  the  reflection 
of  propagating  wave  at  the  interface. 


6.  Conclusions 

In  conclusion,  the  generalized  theory  of  the  Perfectly  Matched  Layers  has  been  used  to 
investigate  the  feasibility  of  constructing  perfectly  matched  layers  in  orthogonal  curvilinear 
coordinates.  The  performance  of  perfectly  matched  layers  in  both  cylindrical  and  spherical 
coordinates  were  examined  using  the  mathematical  framework  of  the  generalized  theory 
of  perfectly  matched  layers.  It  was  found  that  cylindrical  and  spherical  perfectly  matched 
layers  do  not  behave  exactly  like  the  ones  developed  for  cartesian  grids.  More  specifically, 
a  reflectionless  interface  cannot  be  affected  in  such  two  coordinate  systems  unless  the 
perfectly  matched  layer  properties  are  introduced  in  a  smooth  continuous  fashion,  so  that 
a  smooth  transition  is  achieved  from  the  properties  of  the  media  inside  the  computational 
domain  to  those  of  an  absorbing  layer.  This  result  was  validated  through  a  numerical 
experiment  in  cylindrical  coordinates. 
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Abstract 

This  study  discusses  the  effect  of  the  binding  material  permittivity  on  the  input  impedance  of 
contrawound  toroidal  helical  antennas  (CTHAs).  Using  the  finite  difference  time  domain  method 
(fdtd),  two  types  of  CTHAs  are  studied,  the  core  CTHA  and  the  printed  circuit  board  CTHA  (PCB- 
CTHA).  The  results  are  compared  to  a  general  theoretical  analysis  of  the  effect  of  permittivity  on 
input  impedance.  The  study  shows  that  the  effect  of  permittivity  on  the  input  impedance  for  core 
CTHAs  follows  very  closely  the  predicted  effect,  while  the  effect  on  PCB-CTHAs  is  slightly  different 
from  the  predicted  effect.  The  difference  can  be  attributed  to  discontinuities  in  the  fringe  fields  as 
evidenced  by  the  ideal  behavior  observed  when  covering  the  CTHA  in  a  block  of  the  binder  material. 

Introduction 

The  Contrawound  Toroidal  Helical  Antenna  (CTHA)  is  a  patented  antenna  developed  at  the 
Center  for  Industrial  Research  Applications  (CIRA)  of  West  Virginia  University.  The  antenna 
essentially  consists  of  two  toroidal  windings  of  wire,  with  opposite  pitch,  supported  by  a  binding 
material.  The  binding  material,  in  addition  to  holding  the  antenna  together,  also  plays  a  significant 
role  in  the  characteristics  of  the  antenna,  namely  affecting  its  input  impedance  and  gain.  The 
parameter  of  interest  is  the  permittivity  of  the  binding  material.  In  this  paper,  the  theoretically 
predicted  effect  is  compared  to  the  simulation  results  for  two  main  types  of  CTHAs,  the  toroidal  core 
CTHA  and  the  printed  circuit  board  CTHA  (PCB-CIHA). 

The  core  CTHA,  shown  in  Figure  1,  consists  of  two  helical  windings  of  wire  supported  upon  a 
central  core.  The  binding  material  of  the  core  CTHA  is  a  full  block  of  material.  Figure  1  shows  three 
instances  of  the  core  CTHA.  The  first,  for  clarity,  is  just  the  wire  windings  with  no  support  material. 
The  second  shows  the  CTHA  with  full  core  material,  and  the  third  is  for  a  core  CTHA  with  a  hollow 
center.  All  analysis  will  be  conducted  on  the  full  block  core  CTHA. 

The  PCB-CTHA,  on  the  other  hand,  as  is  clear  from  its  name,  has  the  two  wire  windings  in  the 
shape  of  traces  upon  a  printed  circuit  board  (PCB)  and  holes  filled  with  conducting  material  drilled 
through  the  PCB.  Figure  2  shows  two  instances  of  the  PCB-CTHA;  the  first  shows  the  traces  alone 
with  no  binding  material,  while  the  second  shows  the  complete  PCB-CTHA. 
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Figure  1  The  core  CTHA:  a)  wire  only,  b)full  core,  c)  hollow  core 


Figure  2  The  PCB-CTHA:  a)  wire  only,  b)  complete  CTHA 
Resonance  and  Dielectric 

A  small  antenna  such  as  the  CTHA  can  often  be  viewed  as  a  lumped  circuit.  Resonance  occurs 
when  the  reactive  components  of  the  impedance  cancel  at  the  particular  resonance  frequency.  For 
simplicity,  the  CTHA  can  be  considered  as  a  lumped  RLC  circuit.  For  lumped  parallel  and  series  RLC 
circuits,  this  frequency  is  given  by  Equation  1, 

2'x'f-=/jUc'  (,) 


where  L  is  the  circuit  inductance  and  C  is  the  circuit  capacitance.  Since  the  dielectric  constant 
primarily  affects  electric  field  storage,  it  will  change  the  capacitance  of  the  circuit. 

The  influence  of  the  dielectric  on  the  capacitance  can  be  seen  if  one  considers  Equation  2, 
which  gives  the  capacitance  of  a  perfect  (no  fringe  fields)  parallel  plate  capacitor: 
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where  A  is  the  plate  area,  d  is  the  seperation  distance  between  the  plates,  and  e  is  the  dielectric 
constant. 

If  Equation  2  is  substituted  into  Equation  1,  then  the  relationship  of  the  resonance  frequency  to 
the  dielectric  becomes  apparent.  The  resonance  frequency  is  related  to  the  dielectric  constant  via  an 
inverse  square;  if  two  constants,  a  and  b,  are  allowed  to  account  for  the  fringe  fields  of  a  particular 
CTHA  geometry,  a  similar  relation  can  be  expressed  as  in  Equation  3: 

f res  .dielectric  ~  +  ^  ^ res  *  ^ 


From  Equation  3,  the  effect  of  the  dielectric  can  thus  be  defined  as  follows 


(4) 


Numerical  Analysis 

The  numerical  code  used  for  this  analysis  was  REMCOM’s  XFDTD  4.03.  This  code  uses  the 
finite  difference  time  domain  method  for  electromagnetic  calculations.  The  method  is  a  transient 
marching-in-time  approach,  in  which  time  is  divided  into  small  discrete  steps,  and  the  electric  and 
magnetic  fields  on  a  fine  grid  are  calculated  at  each  time  step.  For  more  details  on  this  method  refer  to 
[1]. 

Core  CTHA 

A  typical  core  CTHA  was  modeled  using  XFDTD.  The  parameters  for  this  CTHA  are  shown 
in  Table  1 .  The  relative  permittivity  er  of  the  model  was  then  varied  between  1  and  5  in  steps  of  0.4. 
In  addition  to  the  cases  of  e^2  and  £,=4,  13  simulations  were  conducted.  The  first  four  resonant 
frequencies  for  all  these  simulations  were  then  found  and  grouped.  The  effect  of  the  permittivity  was 
then  calculated  as  shown  in  Equation  4.  Table  2  shows  the  effect  of  the  permittivity  on  the  first  four 
resonance  frequencies,  the  average  effect  for  each  permittivity  value,  and  the  ideal  effect  for  the  case 
of  <3=1  and  b= 0  in  Equation  3. 


Table  1  Parameters  of  the  core  CTHA 


i  Parameters  of  the  core  CTHA 

Major  Radius 

26.67  mm 

Feed  Type 

6 

Minor  Radius 

4.70  mm 

Feed  Position 

Polar 

Wire  Radius 

0.35  mm 

Wire  Material 

PFC 

Wraps 

10 

Cell  Size 

0.55  mm 
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Table  2  The  effect  of  permittivity  on  the  first  four  resonance  frequencies  for  the  core  CTHA 


Er-1.0 

£f=1.4 

£,-1.8 

£,-2.0 

E^2.2 

£,=2.6 

Er=3.0 

Er-3.4 

£,=3.8 

£r=4.0 

£,-4.2 

Er-4.6 

£r=5.0 

F*  (GHz) 

eff 

eff 

eff 

eff 

eff 

eff 

eff 

eff 

eff 

eff 

eff 

eff 

i 

0.323 

0.851 

0.752 

0.718 

0.685 

0.627 

0.596 

0.566 

0.530 

0.509 

0.500 

0.476 

0.450 

2 

0.620 

0.853 

0.757 

0.721 

0.688 

0.636 

0.593 

0.558 

0.529 

0.511 

0.498 

0.477 

0.458 

0.853 

0.760 

0.722 

0.688 

0.636 

0.595 

0.557  j 

;  0.527 

0.513 

0.503 

0.480  j 

0.460 

jEB 

1.253 

0.853 

0.757 

j  0.720 

0.688 

0.635 

0.593 

0.558  j 

0.529 

0.513 

0.501 

0.479 

0.460 

- 

0.853 

0.757 

0.720 

0.687 

0.634 

0.594 

0.560  | 

0.529 

0.512 

BSI 

0.478 

0.457 

- 

0.845 

0.745 

0.707 

0.674 

0.620 

0.577 

0.542  | 

0.513 

0.500 

0.466 

0.447 

Using  regression,  the  a  and  b  factors  of  Equation  3  were  then  found  for  all  five  sets  of  data  (the 
four  resonant  frequencies  and  the  average).  The  factors  for  the  sixth  set  of  data,  the  no  fringe  fields 
case,  are  1  and  0  by  definition.  Table  3  shows  the  a  and  b  factors  for  all  six  equations.  These  six 
equations  were  then  plotted  in  Figure  3. 

Table  3  Factors  obtained  from  the  six  sets  of  data  for  the  core  CTHA 


a 

b  1 

resl 

WEES3M 

res2 

22H 

0.979 

0.025 

ave 

0.981 

0.024 

theory 

1.000 

0.000 
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PCB-CTHA 

Table  4  shows  the  parameters  of  the  first  PCB-CTHA  modeled  for  this  study.  The  same 
procedure  described  above  was  followed.  Table  5  shows  the  effect  of  er  on  the  first  5  resonant 
frequencies,  and  Table  6  shows  the  factors  obtained  from  the  seven  sets  of  data.  Figure  4  plots  the 
seven  equations  together. 

Table  4  Parameters  of  the  PCB-CTHA 


Parameters  of  the  PCB-CTHA 


Length,  Width 

30.30  mm 

Feed  Type 

6 

Height 

3.23  mm 

Feed  Position 

Top  plane 

Wraps 

8 

Wire  Material 

PFC 

Table  5  The  effect  of  permittivity  on  the  first  five  resonance  frequencies  for  the  PCB-CTHA 


£r=1 

£r=1 .5 

ro 

8^2.5 

£r=3 

8r=3.5 

£r=4 

£r=5 

1*222 

eff 

eff 

eff 

eff 

eff 

eff 

eff 

■■ 

0.760 

0.740 

0.668 

0.617 

0.574 

0.481 

2 

IBESH 

2*2131 

0.740 

0.668 

0.615 

0.573 

0.484 

3 

0.744 

0.675 

0.485 

4 

0.843 

0.672 

5 

0.845 

0.678 

0.623 

0.581 

BIB 

0.842 

0.743 

0.672 

0.619 

0.576 

0.541 

0.485 

1 

0.816 

0.707 

0.632 

0.577 

0.535 

0.500 

0.447 

effect  of  P  on  a  regular  PCB-CTHA 


j  — resl  eq 
•~»~res2eq 


j  -A— i*83  eq 


I  res4  eq 

i 

|  res5  eq 

aveeq 
:  — > — theory 


1  1.5  2  2.5  3  3.5  4  4.5  5 

permittivity 


Figure  4  The  effect  ofez  on  the  input  impedance  of  the  PCB-CTHA 
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Table  6  The  factors  obtained  from  the  seven  sets  of  data  for  the  PCB-CTHA 


a 

b 

rest 

0.937 

0.072 

res2 

0.935 

0.073 

res3 

0.929 

0.079 

res4 

0.929 

0.079 

res5 

0.923 

0.087 

Ave 

0.931 

0.078 

theory 

1 

0 

Other  PCB-CTHAs 

Table  6  and  Figure  4  above  show  that  the  resulting  equation  for  the  effect  of  er  on  the  previous 
PCB-CTHA  is  fairly  far  from  the  ideal  case,  where  a  and  b  have  an  average  of  0.931  and  0.078 
respectively.  To  check  on  this  result,  three  other  PCB-CTHAs  were  modeled,  and  the  same  previous 
analysis  was  conducted  on  them.  As  shown  in  Table  7,  the  average  factors  obtained  for  these  three 
other  PCB-CTHAs  gave  the  similar  results  as  those  of  the  previous  case.  Figure  5  plots  the  average 
for  all  four  PCB-CTHAs  and  the  ideal  case  together. 

Table  7  The  average  factors  a  and  b  for  four  PCB-CTHAs 


a 

b 

Geo2 

0.932 

0.077 

Geo3 

0.962 

0.051 

Geo4 

0.937 

0.063 

theory 

1 

0 

Ave  effect  of  P  on  4  PCB-CTHAs 


-♦-Geol 
~»~Geo2  | 
Geo3 

. G  eo4 

Ideal 


Figure  5  The  average  effect  of£t  on  4  PCB-CTHAs  compared  to  the  ideal  effect 
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Covered  PCB-CTHA 

The  difference  obtained  for  the  4  PCB-CTHAs  was  suspected  to  be  caused  by  the  fringe  fields 
of  this  CTHA  geometry.  To  verify  that,  the  first  PCB-CTHA  modeled  above  was  remodeled,  but  this 
time  the  whole  CTHA  was  covered  in  a  thick  layer  of  dielectric  material.  In  this  case,  most  of  the 
fringe  fields  for  the  CTHA  will  fall  within  the  binding  material.  Table  8  shows  the  factors  obtained 
from  the  first  five  resonant  frequencies  of  this  CTHA,  in  addition  to  the  average  and  ideal  factors, 
while  Figure  6  plots  these  7  equations  together. 

It  is  clear  that  for  this  geometry  the  effect  of  £r  on  the  resonant  frequencies  was  almost  identical 
to  that  predicted  by  the  perfect  case  of  Equation  3. 

Table  8  The  factors  obtained  from  the  seven  sets  of  data  for  the  covered  PCB-CTHA 


a 

b 

resl 

0.990 

0.008 

res2 

0.995 

0.005 

res3 

1.002 

0.000 

res4 

0.995 

0.005 

res5 

1.003 

0.000 

Ave 

0.997 

0.003 

theory 

1 

0 

Figure  6  The  effect  ofZt  on  the  input  impedance  of  the  covered  PCB-CTHA 
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Conclusion 

Theory  suggests  a  relationship  between  resonant  frequency  and  the  relative  permittivity  of  a 
binder  material  for  a  CTHA.  Modeling  verifies  this  relationship  for  toroidal  core  CTHAs  and  is 
closely,  but  not  exactly,  followed  for  PCB-CTHAs.  The  theory  that  this  deviance  may  be  attributed  to 
fringe  fields  traversing  the  air/dielectric  boundary  is  supported  by  the  exact  match  of  theory  and 
simulation  when  a  PCB-CTHA  is  covered  by  the  dielectric  (i.e.  no  air/dielectric  boundary  within  the 
near  field). 

The  average  factors  a  and  b  of  Equation  3  for  the  core  CTHA  and  the  covered  PCB-CTHA 
highly  agree  with  the  perfect  case  of  equation  3.  In  both  cases  a  was  approximately  1  while  b  was 
approximately  0.  On  the  other  hand,  all  the  regular  PCB-CTHAs  modeled  showed  a  slight  shift  from 
the  ideal  case.  For  all  4  PCB-CTHAs  modeled,  a  was  approximately  0.93  while  b  was  approximately 
0.07. 


These  results  provide  a  precise  tool  for  predicting  the  input  impedance  and  resonant 
frequencies  for  CTHAs  with  only  the  knowledge  of  the  input  impedance  at  one  known  value  of  er 
required.  It  must  be  noted  that  it  is  assumed  that  er  is  constant  with  respect  to  frequency  and  that  the 
dielectric  is  a  simple  isotropic  material.  This  study  also  shows  that  the  fringe  fields  of  the  PCB-CTHA 
have  a  clear  effect  on  its  input  impedance,  contrary  to  the  core  CTHA,  where  this  effect  is  negligible. 
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Nomenclature 

e 

absolute  permittivity 

£r 

relative  permittivity 

a,  b 

regression  coefficients 

C 

capacitance 

CTHA 

Contrawound  Toroidal  Helical  Antenna 

eff 

effect  of  the  dielectric  on  the  resonance  frequency 

f«s 

resonance  frequency 

fres  dielectric 

resonance  frequency  for  a  specific  dielectric 

L 

inductance 

PCB-CTHA 

Printed  Circuit  Board  CTHA 
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Abstract 

The  numerical  dispersion  relation  of  a  2-D  time-domain  Haar  wavelet  modeling  technique  has  been  examined 
by  simple  but  precise  numerical  experiments,  and  the  results  were  compared  with  analytical  results.  It  was  found 
that,  under  the  presence  of  boundary  conditions,  the  numerical  dispersion  is  not  in  agreement  with  those  obtained 
with  analytical  formulae;  it  is  better  than  that  of  the  FDTD  method  when  the  discretization  is  coarse,  and  it 
asymptotically  converges  to  that  of  FDTD  as  the  discretization  becomes  finer. 

I.  Introduction 

Space-discrete  analysis  techniques  intrinsically  have  a  numerical  dispersion  error  due  to  the 
finite-interval  sampling  of  the  continuous  space.  For  those  techniques,  analysis  of  the  numerical 
dispersion  relation  is  important  because  it  provides  a  clear  insight  into  the  accuracy  and  the 
efficiency  of  the  numerical  technique. 

For  the  Haar- wavelet  based  multiresolution  time-domain  (Haar-MRTD)  technique  [1],  [2],  the 
analytical  formula  for  the  numerical  dispersion  has  been  reported  in  [3].  However,  it  has  not  been 
verified  by  precise  numerical  experiments.  Moreover,  since  the  Haar-MRTD  technique  employs 
boundary  conditions  that  enforce  the  basis  functions  coupled  at  the  boundaries,  it  is  predicted 
that  the  numerical  dispersion  behaves  differently  form  the  analytical  formulae  under  the  presence 
of  the  boundary  conditions. 

In  this  paper,  we  investigate  the  numerical  dispersion  relation  of  the  Haar-MRTD  technique  by 
means  of  numerical  experiments  and  compare  the  results  with  the  analytical  formulae  available 
in  the  literature. 

For  the  numerical  experiments,  rectangular  waveguide  cavity  resonators  with  various  aspect 
ratios  were  used  to  realize  oblique  angle  propagation  of  plane  waves.  This  allows  us  to  obtain 
dispersion  relations  for  various  angles  of  propagation.  The  boundaries  were  all  perfect  electric 
conductor  (PEC)  conditions;  this  avoids  the  unknown  dispersion  error  of  absorbing  boundary 
conditions.  For  a  waveguide  with  a  fixed  aspect  ratio,  TEu  mode  oscillation  was  excited  and  the 
resonant  frequency  was  extracted  from  the  time  response  by  Fourier  transformation.  The  error  of 
the  resonant  frequency  was  plotted  as  a  function  of  the  normalized  wavenumber. 
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II.  Time  Iterative  Difference  Equations 

Discretization  of  Maxwell’s  curl  equations  for  the  TE  case  involves  the  Ey,  Hx  and  Hz  com¬ 
ponents.  Using  Haar  scaling  and  wavelet  functions  yields  time  stepping  update  equations  of  the 
following  form  [2] 


Him  =  |Tu  +  iC5E®^-r«.)  (1) 

fjo,  =  iroi+«s(-jn,+r«)  (2) 

Woo  =  n  Woo  +  >31  {?» -»*-(»»  -&)-&}  (3) 


where  the  notations  for  voltage  currents  £/**,  ¥ik  and  current  source  yJik  are  defined  as  ^10 
=  ¥oh  =  n—i/z^ije+i/2'>  V¥oo  =  n+i/2*^fc>  and  so  Ono  for  the  wavelet  and  scaling  functions 

£,  C  =  <i>,  if,  and  for  the  time  and  space  indices  n,  i  and  k,  respectively.  The  coefficients  are  given 

by 


VS 

vs 

VL 

yQk 


At 

fAll 

Vih 

lA-zj 

At 

\Az] 

Vik 

LAmJ 

2cjjt  - 

2eik  +  ffflt  A  t 

2A  t  r  1  1 

2eik  +  crikAt  LAa:AzJifc 


(4) 

(5) 

(6) 
(7) 


The  equations  (l)-(3)  are  computed  independently  for  the  individual  basis  functions  ££  =  <fxf), 
<fyip,  ipi p  and  if  ip. 


III.  Numerical  Dispersion  Relation 

Numerical  techniques  that  involve  discrete  finite  difference  grids  inherently  have  a  numerical 
dispersion  error.  It  is  caused  by  the  finite  dimension  of  the  grid  and  cannot  be  ignored  when 
the  wavelength  is  comparable  to  the  unit  cell  size.  An  analytical  dispersion  relation  is  derived  by 
substituting  a  time-harmonic  trial  solution  in  the  finite  difference  form  of  Maxwell  equations  [4]. 
The  dispersion  error  is  a  function  of  the  number  of  cells  per  wavelength,  the  angle  of  propagation 
and  the  Courant  stability  factor  defined  by 

s  —  cAt/Al  (8) 

where  c  is  the  speed  of  light,  At  the  time  step  and  A l  the  spatial  grid  interval. 

A.  Analytical  formula 

The  analytical  dispersion  relation  for  the  conventional  FDTD  algorithm  in  the  case  of  two  space 
dimensions  is  [4] 


while  the  dispersion  relation  for  the  Haar-MRTD  technique  is  given  in  [3]  as 


These  analytical  dispersion  relations  are  shown  in  Fig.l  for  two  stability  factors  s  —  0.9/y/2  and 
OA/y/2  and  two  directions  of  propagation  a  =  45°  and  90°  (identical  to  0°).  The  grids  were  square 
At  —  Az  =  A l.  The  dispersion  relation  for  the  Haar-MRTD  technique  is  in  general  superior  to 
that  of  the  conventional  FDTD  method.  The  dispersion  relation  for  a  plane  wave  w  =  ck  leads 
to  the  ideal  dispersion  relation  for  a  certain  number  of  At  and  Al  as 

H  =  xs  ,  (11) 

where  Q,  (=  u  At)  and  X  (=  k  Al)  denote  the  normalized  angular  frequency  and  the  normalized 
wavenumber,  respectively,  and  s  the  stability  factor  (8). 


B.  Numerical  Experiments 

The  numerical  dispersion  relation  was  tested  by  analyzing  the  TEn  mode  resonance  in  rectan¬ 
gular  cavities. 

The  resonant  frequency  of  the  TEn  mode  in  a  rectangular  cavity  of  width  a  and  length  6  shown 


in  Fig.  2  is  given  by 

*“W(s),+(s) 

(12) 

and  the  wavenumber 

k=jki+ki=]](l)  +(:)  . 

(13) 

In  the  cavity  with  TEn  mode  oscillation, 
waves  propagating  at  the  angle  of 


the  field  can  be  considered  as  the  superposition  of  plane 
6  =  arctan  -  (14) 


a 

with  respect  to  the  main  coordinate  axis.  Therefore,  by  testing  resonances  in  various  aspect 
ratios  of  rectangular  cavities,  one  can  experimentally  obtain  the  numerical  dispersion  relation 
as  a  function  of  the  angle  of  propagation.  The  testing  procedures  were  as  follows:  (i)  fix  the 
propagation  constant  k  by  (13),  and  the  angle  of  propagation  6  by  (14),  then  (ii)  perform  numerical 
analysis  and  obtain  resonant  frequency  fT  from  the  time  response  by  Fourier  transformation,  and 
finally,  (iii)  plot  the  frequency  error  (ft  -  xs)/(x$)  (^  =  27r/rAt)  as  a  function  of  the  normalized 
wavenumber  x  (=  kAl). 

The  angle  of  propagation  of  45°  (a  =  1,6  =  1),  63°  (a  =  1,6  =  2),  78°  (a  =  1,6  =  5)  and 
84°  (a  =  1, 6  =  10)  were  investigated  and  compared  to  the  analytical  solution  of  the  dispersion 
equations  in  Fig.  3.  Because  of  the  location  of  the  PEC  boundaries  that  are  shifted  by  quarter 
cell  size  from  the  grid  lines,  the  angles  of  propagation  cannot  be  exactly  identical  for  the  different 
discretizations;  they  have  a  tolerance  of  approximately  1%,  but  the  computational  results  are  not 
sensitive  to  this  tolerance. 
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2.5 


(a)  FDTD 


(b)  Haar-MRTD 

Fig.  1.  Analytical  dispersion  relations  for  (a)  2-D  FDTD  [4],  and  (b)  Haar-MRTD  techniques  [3].  Normalized 
angular  frequency  fl  =  uiAt  is  plotted  as  a  function  of  normalized  wavenumber  x  =  fcA l. 
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Fig.  2.  A  top  view  of  the  rectangular  cavity  resonator  for  the  numerical  experiments,  a  denotes  the  angle  of 
propagation,  and  k  the  propagation  constant  of  the  plane  wave.  Thin  lines  represent  Yee’s  grid  lines,  PEC 
walls  are  located  at  a  quarter  cell  size  away  from  the  grid  lines. 


It  is  shown  in  these  figures  that,  in  the  limit  of  infinitely  small  grids,  the  experimentally  obtained 
dispersion  curves  agree  with  those  of  the  analytical  dispersion  relation  of  the  FDTD  method,  while 
for  coarser  grids,  the  Haar-MRTD  technique  is  less  dispersive  than  the  conventional  FDTD. 

It  should  be  mentioned  here  that  the  analytical  dispersion  relation  of  (10)  shown  in  Fig.l  is 
not  in  agreement  with  the  dispersion  relations  obtained  experimentally  (shown  in  Fig.3).  The 
dispersion  of  the  Haar-MRTD  technique  should  obviously  follow  that  of  FDTD  in  the  limit  of  fine 
grids  when  the  normalized  wavenumber  tends  to  zero.  As  the  grids  become  coarser,  the  influ¬ 
ence  of  the  boundary  conditions  on  the  experimental  results  becomes  apparent.  For  coarse  grids 
(largest  normalized- wavenumber  in  the  plots),  the  dispersion  relation  changes  abruptly  because 
the  opposite  sides  of  the  PEC  boundaries  approach  and  cause  mutual  interaction. 

IV.  Conclusions 

The  numerical  dispersion  relation  of  the  time-domain  Haar-wavelet  modeling  technique  was 
investigated  with  simple  but  precise  numerical  experiments.  The  dispersion  relation  that  was 
obtained  by  numerically  analyzing  the  TEn  mode  in  rectangular  cavities  suggests  results  that  are 
similar  to  those  obtained  with  FDTD  method;  the  numerical  dispersion  of  the  Haar-MRTD  tech¬ 
nique  is  better  than  that  of  the  conventional  FDTD  method  when  coarser  grids  are  incorporated, 
and  it  asymptotically  converges  to  that  of  FDTD  as  the  discretization  becomes  finer. 
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Fig.  3.  Experimentally  obtained  numerical  dispersion  relations  for  2-D  Haar-MRTD  modeling.  Normalized 
frequency  error  (fi  —  xs)/0fs)  is  plotted  as  a  function  of  normalized  wavenumber  x  =  k&l. 
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Abstract: 

The  aim  of  this  study  is  to  develop  and  compare  a  Finite  Element  method  called  the  Discontinuous  Galerkin 
method  and  a  Finite  Volume  method,  solving  Maxwell  equations  in  heterogeneous  media.  The  comparisons  are 
based  on  the  CPU  time,  and  the  quality  of  the  solutions.  The  first  advantage  of  these  two  methods  is  then- 
flexibility  in  treating  complicated  geometries.  The  second  advantage  is  their  ability  to  capture  the  discontinuities 
of  the  electromagnetic  fields  through  different  media,  without  producing  spurious  oscillations.  We  describe  m  this 
paper  the  above  mentioned  methods  and  give  numerical  comparisons. 

1  Introduction: 

These  both  methods  are  well  suited  for  handling  complex  geometries,  therefore  we  use  unstructured  meshes  that 
conform  to  these  geometries.  Moreover,  they  make  use  of  the  volume  integral  form  of  the  Maxwell  equations, 
which  provides  a  very  natural  way  to  verify  the  transmission  conditions  at  the  material  mterfaces.  Owing  to  these 
properties,  we  can  treat  difficult  problems,  such  computing  a  scattered  wave  across  complex  objects. 

The  Maxwell  equations  are  given  by  : 

||  +  rot(E)=0 

,  "  rot(R)  =  "J  (1) 
div(D)  =  p 
<fiu(B)  =  0 


where  E  and  H  are  the  electric  and  magnetic  fields  respectively,  and  D  and  B  are  the  electric  and  magnetic 
inductions  respectively  .  J  is  the  current  density  and  p  is  the  charge  density.  The  electric  permittivity  e  =  e(z,  y,  z ), 
and  the  magnetic  permeability  p  =  p(x,y,  z)  are  piecewise  constant  functions.  In  the  linear  and  isotropic  case,  we 
have  the  following  relationships  : 

B  =  mH 
D  =  sE 


We  need  the  conservative  form  of  the  system  (1)  to  construct  the  numerical  schemes,  it  is  given  by  : 


^  +  divW(Q)  =  -J 


(2) 
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with  Q  =  ((B,D),  and  IF  =  (Fi,FltF3),  with 


0 

D-Je 

r  -Dy/e 

-Dz/e 

0 

Dx/e 

Dy/e 

0 

■>  F2(Q)=< 

1  l 
to  to 

•eT 'm’ 

f3(Q)  =  < 

0 

By/n 

Bs/n 

0 

~Bx/n 

.  -By  lil  j 

.  . 

0 

Because  of  the  truncated  scattered  wave  domain,  we  consider  an  artificial  boundary  which  requires  that  any 
outgoing  wave  disappear  without  reflection.  Thus,  we  consider  two  types  of  boundary  conditions: 

1- The  metallic  boundary  condition  given  by  n  A  E  =  0. 

2-  The  Silver-Muller  absorbing  condition  on  the  artificial  boundary  given  by: 
n  A  E  =  -\fyje.n  A  (ff  A  H) 


2  Development  of  the  numerical  schemes: 

2.1  Finite  Volume  Scheme: 

The  approach  consists  in  integrating  the  conservative  form  (2)  over  “control  volumes”.  We  consider  a  conforming 
unstructured  mesh  made  up  of  triangles  T*  with  centroids  G*.  We  use  as  control  volumes,  the  triangles  (figure  1). 
One  of  the  advantages  is  that  the  triangles  are  the  most  appropriate  cells  to  define  the  interfaces  between  different 
media,  and  to  take  into  account  the  current  sheets,  especially  in  the  heterogeneous  materials  case  ([3j,  [4j). 


Figure  1:  Control  volume 

Using  a  Green's  formula  and  an  approximation  of  the  interface  integrals,  yields  the  following  result: 

3 

|3il(Q)i  +  ^  +  boundary  terms  =  -Jj,  (3) 

jf=i 

$ij  =  $(Qi,Qj)  =s  J  IF  .nn 

dT'nsr, 

where  is  an  approximation  of  the  integral  over  the  interfaces,  depending  on  both  values  Q,  and  Qj.  The  first 
value  is  the  approximate  solution  into  the  interior  of  the  element,  and  the  second  one  is  the  approximate  solution 
outside  the  element,  mj  is  the  outward  unit  normal  to  dTi  n  dTj.  To  obtain  a  high  order  space  scheme,  we  use 
the  MUSCL  [5]  technique:  we  replace  the  arguments  of  the  flux  function  by  linear  reconstruction  functions  Ly 
given  by  the  following  equation  : 

Ly  =  Qi  +  (1  -2/?)(Qi  ~Q,)  +  2/?VQr,  .Sy,  0  €  [0, 1] 

Sjj  is  the  distance  between  the  mass  center  G;  of  T,  and  the  mid-point  of  the  edge  dTi  n  dTj.  We  have  to  evaluate 
the  discrete  gradient  VQ^  on  each  element  T,. 
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Finite  Volume  Gradient 

We  approach  the  gradient  by  the  following  formula  : 


vQr.  =  T}ij~  /  nii 

'  j=1  ~  dT'naT' 

Treatment  of  the  metallic  boundary  condition 

In  order  to  compute  a  value  of  a  numerical  flux  at  the  metallic  boundary,  we  use  a  local  exact  resolution  of  the 
system  taking  into  account  the  condition  n  A  E  =  0. 


Time  scheme 

In  order  to  obtain  a  third  order  accurate  scheme  in  time,  we  use  a  Runge-Kutta  scheme  with  three  steps. 

We  present  here  as  an  example,  the  computed  result  for  the  magnetic  field  of  an  heterogeneous  circular  dielectric 

cylinder  scatterer  (figure  2). 


Figure  2:  A  zoom  of  the  mesh  of  the  cylinder  at  left  and  the  scattered  magnetic  wave  across  a  dielectrical  material 

at  right. 
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2.2  Finite  Element  scheme  (Discontinuous  Galerkin  method): 

In  this  section,  we  extend  the  Discontinuous  Galerkin  method  [1]  to  solve  the  Maxwell  equations,  in  heterogeneous 
media.  This  method  is  based  on  the  choice  of  a  local  basis  functions  and  quadrature  rules.  Note  that  the  functions 
axe  discontinuous  in  the  space  of  discretization.  We  propose  to  construct  a  second  order  scheme,  therefore  we  use 
polynomials  of  degree  1  as  basis  functions  [2J.  In  order  to  verify  the  transmission  conditions  between  different 
media,  we  use  a  Godunov  scheme  to  compute  the  numerical  flux.  Let  the  solution  domain  D  be  divided  into  a  set 
of  triangles  T*.  The  solution  in  each  element  T,  will  be  possibly  discontinuous  across  element  boundaries. 

We  first  define  the  space  of  discretization  VJ,1  by: 

V;1  =  {v  6  V/vfa  €  P\*  =  1,  ...M) 


where 


y  =  i2(nx[0,oo]) 

P1  is  the  set  of  polynomials  of  degree  1.  To  construct  the  approximate  solution  Qh  £  V£, 
we  use  the  weak  solution  defined  as  follows  : 

TtSr.  Q(X,t)<pk(X)dTi  +  '£jeN{i)  JannOT,  IF(Q (X,t)).nijipk(X)d<r 
- ST,  W{Q{X,t))V<pk(X)dTi  =  -  fTi3{X,t)<pk(X)dT{ 

N(i)  denotes  the  number  of  neighbours  of  the  triangle  T,-,  riij  the  outward  unit  normal  to  the  edge  dTi  n  8Tj,  and 
tpk,k  =  1,  ..3  are  the  basis  functions,  each  ipk  takes  the  value  1  at  the  mid-point  mk  of  the  k-th  edge,  and  0  at  the 
others  mid-points  of  the  triangle’s  edges. 

We  first  approach  IF(Q(X,t)).n.fj  by  a  numerical  flux,  and  then  we  replace  each  integral  by  a  Gauss  quadrature 
rule  in  order  to  obtain  a  second  order  in  space  scheme.  After  inverting  the  mass  matrix,  which  is  diagonal  since 
the  basis  functions  are  orthogonal,  we  get  the  ODE  : 

■^Qh  =  lh(Qh,Qh) 

We  use  for  the  time  discretization,  a  Runge-Kutta  scheme  with  two  steps  given  by  : 

f  Q°  =  Qn  , 

<  Q lH  =  Ei=0  <*i&h  +  PuAtUiQlQKt  +  d'At))  1  =  1,2, 

(  Qr1  =  Q i 

is  the  boundary  condition,  The  parameters  are  given  by  : 


QUO  =  1 

a 20  =  t  | 

<*21  =  5 

0io  —  0 

020  =  0  ! 

021  =  7 

do  =  0 

|  di  =  l  | 

Table  1:  parameters  of  the  Runge-Kutta  scheme 

We  give  here  some  numerical  results  obtained  with  the  Discontinuous  Galerkin  method.  The  figure  (3)  is  a  scattered 
electric  wave  across  an  heterogeneous  cylinder  section,  and  the  figure  (4)  is  the  electric  field  produced  by  a  sheet 
of  current  over  the  cross  section  of  a  rectangular  waveguide. 
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Figure  3:  Scattered  electric  wave  across  a  dielectrical  material. 


Figure  4:  Electric  wave  in  a  rectangular  waveguide 


We  recall  that  the  aim  of  this  study  is  to  compare  the  two  methods.  We  present  in  this  section  two  numerical 
tests  The  first  test  consists  in  computing  the  evolution  of  a  mode  of  resonance  in  a  cavi  y.  an  we  giv 
of  computation  for  the  two  methods  (table  2).  We  also  give  the  comparison  of  the  energies  (figure  6).  We  use  two 
different  meshes  for  the  two  methods  in  order  to  have  the  same  number  of  degrees  o Jo  ^33 
resolution  for  ten  wavelengths  (figure  5).  The  numerical  experiments  are  run  on  a  DEC  machine  (Alpha  400  -33 

MHZ,  5  Specfp  95)  - 
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Figure  5:  Time  evolution  of  the  resonance  mode  (electric  field) 


method 

Finite  Volume 

Discontinuous  Galerkin 

CFL 

HSB 

0.2 

time/iteration  (s) 

i 

2.98 

global  time(s) 

70 

220 

Table  2:  Time  of  computation  (CPU). 


Figure  6:  A  comparison  of  energy 


(Figure  6)  shows  that  the  Discontinuous  Galerkin  method  diffuse  less  than  the  Finite  Volume  one,  however 
thev  both  tend  to  zero. 

The  second  test  consists  in  a  qualitative  analysis  in  an  heterogeneous  case.  It  represents  a  propagation  of  a  pulse 
into  a  material  composed  of  vacuum  at  right  of  x  =  0  and  a  glass  at  the  left  (figure  7).  We  notice  that  both 
methods  give  a  good  approximate  solution. 


Figure  7:  Pulse  propagation  into  an  heterogeneous  medium 


3  Conclusion 

We  have  presented  in  this  paper  two  methods  for  solving  the  Maxwell  system,  with  discontinuous  coefficients. 
We  can  notice  that  the  use  of  the  triangles  as  integration  volumes  facilitates  the  treatment  of  problems  with 
complex  geometries.  We  can  also  remark,  that  the  transmission  conditions  are  verified  in  the  heterogeneous  case, 
by  both  methods.  Numerical  results  are  shown,  and  we  deduce  that  the  Finite  Volume  method  gives  better  CPU 
performance  than  the  Discontinuous  Galerkin  one,  to  obtain  the  same  quality  results. 
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Introduction 

The  master  system  of  equations  of  collision-iess  plasma  physics  is  the  Vlasov-Maxwell  system.  The  main  result 
in  this  field  has  been  obtained  in  1989  by  R.J.DiPema  and  P.L.Lions  {2].  They  prove  existence  of  global  weak 
solutions  for  the  Cauchy  problem  with  arbitrary  data.  The  situation  is  much  better  for  the  Vlasov-Poisson  system. 
This  system  is  obtained  for  the  first  one  by  neglecting  the  magnetic  field.  However  for  applications  like  vacuum 
diodes,  tube  discharges,  cold  plasma,  solar  wind,  satellite  ionization,  thrusters,  etc...  boundary  conditions  have  to 
be  taken  into  account.  Periodic  regimes  are  particularly  important.  One  strong  motivation  to  study  such  solutions 
is  the  great  difficulty  to  compute  it  numerically.  In  this  paper  we  introduce  the  Limit  Absorption  Method  (LAM) 
which  has  been  developed  by  the  authors  in  [1]  to  obtain  numerical  periodic  solutions  of  PDE.  The  principle  of  this 
method  was  proved  on  a  reduced  ID  model.  We  apply  then  this  methodology  to  the  electro-magnetic  scattering 
problems  and  also  to  the  transport  of  charged  particles  in  electro-magnetic  field. 


1  A  reduced  ID  model 


In  this  section  we  study  the  convergence  to  the  periodic  regimes  for  the  solutions  of  an  ODE  with  a  periodic  source 
term.  The  main  tool  consist  of  a  new  controllability  method,  the  Limit  Absorption  Method  (LAM).  Consider  the 
model  ODE  : 

^  +  s  ■  x{t)  =  f(t)  ,t>  0  (1) 

with  the  initial  data  x(0)  =  xo,  where  /  is  a  T  periodic  function  in  time.  It  is  easy  to  see  that  the  exact  solution 
of  this  problem  is  given  by  : 

x(t)  =  x0  ■  e~€t  +  f  f(s)  ■  e~s(t~s)ds.  (2) 

Jo 

Thus  the  condition  of  periodicity  x(0)  =  x(T)  implies  : 


m-e-iT-)ds 

*°  = - - ' 

and  therefore,  the  periodic  solution  is  given  by  the  formula  : 


zoo (t)  =  J~°r/(f_ee.!r~5)"f  '  e"£‘  +  jf  /(fi)  ‘  e-£(t-s)<is. 


(3) 


(4) 


Now  it  is  clear  to  see  that  if  (3)  is  not  satisfied,  reaching  a  77-approximation  of  the  periodic  regime  x^,  requires  a 
time  : 


t**0{\ log(i)). 
£  77 


(5) 


For  small  values  of  the  parameter  s,  this  time  become  very  long,  which  implies  an  important  amount  of  calculus. 
In  order  to  accelerate  the  convergence  to  the  periodic  states,  an  absorption  term  A  •  x(t)  was  introduced  : 
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A  ■  a(t)  +  +  £  •  ac{t)  =  f{t)  ,  t  >  0. 


(6) 


This  time,  we  get  a  77-convergence  time  : 

t*  ~  °(-TT  lQS(^))  <  °(t  1os(~))-  (?) 

£  +  A  T)  A  77 

In  contrast  to  (5),  the  time  given  by  (7)  is  independent  in  respect  to  the  parameter  e.  However,  the  solution 
obtained  from  the  modified  equation  (6)  doesn’t  verify  (1).  The  idea  consist  of  replacing  the  absorption  parameter 
A  >  0  by  a  sequence  (A„)n>o  which  converges  to  0.  We  have  proved  the  following  proposition: 

Proposition  1  Let  (An)„>0  be  a  sequence  of  positive  real  numbers  which  verifies: 


(i)  limn_oo  A„  =  0, 

(ii)  E„>oA„  <  00, 

and.  loo  be  the  periodic  solution  (4)  of  the  problem  (1).  Then  the  equation: 

\{t)-x(t)  +  ^  +  e-x(t)  =  f(t)  >  t>  0, 


(8) 

(9) 


with  the  initial  condition: 


x(0)  =  xq, 


where  f  G  is  a  T  periodic  function  and  A (i)  =  An,  t  €  [nT,  (n  +  1)T[,  n  >  0  has  a  unique  solution  which 
converges  to  the  periodic  state  : 


lim  ||z(nT  +  •)  -  Soo(-)!Il«(o,t)  =  0. 


(10) 


We  want  to  apply  Proposition  1  for  a  particular  choice  of  (An)„>0.  We  have  the  following  proposition: 

Proposition  2  With  the  same  notations  as  in  Proposition  1,  we  assume  that  f  6  L°°  is  a  periodic  function  which 
verifies: 

f  f($)ds  =  0, 

J  0 

and  that  the  sequence  (An)n>o  is  given  by: 

A0  =  1, 

An+1  =  An,  if  |X„+1  -  Xn  |  >  V:  ™  >  0, 

An+1  =  A„e_T,  if  jxn+i  -Xn\<V,  n>0.  (11) 

Then,  the  convergence  time  to  a  77- approximation  of  the  periodic  state  is  given  by: 

f*«0(4=-log(^)).  (12) 

yfn  V 

Remark  1  The  time  evolution  of  the  parameter  A  can  be  given  also  by  an  ODE  : 

^  +  A (t)  ■  X[-v,ij](a:n  ~  3n-i)  =  0,  t  €  [1 iT,  (n  +  1)T(,  (13) 

where  X[-i7nl  *s  characteristic  function  o/(— 77,77]  and  we  get  the  same  result  as  in  Proposition  2. 
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Generally,  the  resolution  of  the  problem  (1)  is  performed  using  the  system  : 

dm 

\{t)  •  x(t)  +  —  +  £  ■  x(t )  =  /(f),  t  >  0, 

^  +  X[-„,„](C(2:»A(i)  =  0,  t  >  0,  (14) 

with  the  initial  conditions  : 


z(0)  =  To, 

A(0)  =  A0, 

where  C  denotes  a  criterion  of  periodicity.  This  method  is  called  the  Limit  Absorption  Method  (LAM).  It  is  easy 
to  see  that  if  A0  =  0,  then  A(£)  =  0 ,Vf  >  0  and  therefore  the  solution  of  (14)  is  exactly  the  solution  of  (1).  This 
method  is  called  the  Classical  Method  (CM). 

We  consider  a  periodic  term  source  /(£)  =  cos(wf).  The  periodic  state  of  (1)  is  given  by  : 

Xeoit)  =  (ecos(wi)  +  u«m(«f))/(£2  +  w2),  t  >  0.  (15) 

We  compare  the  numerical  solutions  obtained  for  e  =  0.01  and  e  =  0.0001.  using  the  two  methods.  In  the  first 


:KAAAAAAAAAy 


6204  1:105 


Figure  1:  Convergence  to  the  periodic  states  by  the  Classical  Method 
(left  :  e  =  0.01  /  right  :  e  =  0.0001 ) 

case  (  e  =  0.01)  a  good  approximation  of  the  periodic  state  is  obtained  after  70  periods  using  the  (CM)  and  after  7 
periods  using  the  (LAM).  In  the  second  case  (  £  =  0.0001),  a  very  long  time  is  required  to  approximate  the  periodic 
regime  using  the  (CM)  (about  6000  periods),  whereas,  using  the  (LAM),  a  good  approximation  is  obtained  only 
after  7  periods. 
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Figure  2:  Convergence  to  the  periodic  states  by  the  LAM 
(left :  t  =  0.01  /  right  :  e  —  0.0001,) 


2  Application  of  the  LAM  to  the  Maxwell  system 


We  are  concerned  now  with  the  resolution  of  the  scattering  problems  for  harmonic  planar  waves  by  two  dimensional 
purely  reflecting  non-convex  obstacles.  Numerical  results  obtained  by  the  two  methods  introduced  above  are 
compared.  Using  the  methodology  of  the  previous  section,  the  Maxwell  equations  write  : 


f  A (t)H  +  no  +  rot(E)  —  0, 
[  A (t)E  +  e0~-rot(H)  =0, 


(16) 


where  (£,H)  denotes  the  electro-magnetic  field,  e0  and  Mo  axe  respectively  the  permittivity  and  the  permeability 
of  the  vacuum  and  ft  is  the  computational  domain.  Boundary  conditions  are  imposed  on  T  =  Tmet  U  T*,  where 
TmtU  are  respectively  the  metallic  and  artificial  boundaries.  The  time  evolution  of  the  absorption  parameter 
X(t)  is  given  by  the  control  equation: 


—  +  X[-,,,](C(JJ,  E))  ■  m  =  0,  t  >  o,  (17) 

where  C  represents  a  criterion  of  periodicity.  Since  the  Maxwell  system  is  conservative  and  hyperbolic,  we  choose 
the  numerical  approximation  based  on  finite-volume  upwind  schemes  [6,  3]. 

We  present  now  the  criterion  of  periodicity.  Suppose  that  the  solution  is  an  harmonic  wave  : 

Q{t,x)  =  Q0(x)cos(ut  +  ip). 


Then,  we  have  : 


Performing  integration  with  respect  to  t  and  x,  we  get  : 


/  {jcHf)  +uj2Q2dx  =  amstant' 


which  leads  to  the  following  criterion  of  periodicity  : 


C1  = 


JaX (f  )2  (ndt)  +  jQHndt)  -  (f  )2  ((n  -  1)*)  -  u ;2Q2((n  -  l)dt)}dx 
Jfi{(§)2  ((”  -  1  )dt)  +  u*Q2({n  -  1  )dt)}dx 
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If  the  solution  is  not  an  harmonic  wave,  the  following  more  general  criterion  can  be  used  : 

CHQtnT))  - 

{Q  l'  faQ’((n-l)T,x)dx 

c,lo(nT))  Jnl0(nr,x)-g((n-i)r,^)pdx 

C  (QlnT))  -  JaQi((„  -  i)T,x)ix  '  ~  ' 

The  numerical  experiment  concerns  a  perfectly  conducting  circular  cavity  with  an  open  crack,  and  we  consider  the 
transverse  magnetic  (TM)  problem  on  a  domain  SI  =]0,  llm[x]0,  llm[: 


Figure  3:  Circular  Cavity 


The  incident  field  is  given  by  the  formulas  : 

* = -  +wrMt  -  t”’  £-  -  “s(“(t  - 


kxX  +  kyy^ 
c|fc| 


with  |jfc|  =  lj/c  =  *V2,  kx  =  -|Jfc|,  ky  =  0.  The  mesh  used  is  finite-element  type  and  nonstructured  (triangles). 
The  number  of  points  per  wavelength  (A  =  \/2)  is  approximately  15.  Figures  (4)  shows  the  component  Hx  of 


Figure  4:  Hx  component  of  the  magnetic  field 
(left :  Classical  Method  /  right  :  LAM) 

the  magnetic  field.  The  solution  calculated  by  the  (CM)  (A  =  0)  doesn’t  converge  even  after  300  periods,  whereas, 
using  the  (LAM),  good  solutions  are  obtained  after  50  periods.  Figure  (5)  shows  the  convergences  to  zero  of  the 
criterion  of  periodicity  and  the  time  evolution  of  the  absorption  parameter  A(t),  which  tends  also  to  zero. 
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Figure  5:  Criterion  of  periodicity  using  the  LAM/  Evolution  of  parameter  \(t) 


3  Application  of  the  LAM  to  the  transport  of  charged  particles 

The  motion  of  a  fluid  of  charged  particles  placed  in  an  electro-magnetic  field  can  be  described  in  terms  of  particle 
distribution  function  by  the  Vlasov  equation  : 

dtf  +  v-  Vx/  4-  —{E  +  v  A  (fiol T))  •  V,/  =  0.  (18) 

m 

The  electro-magnetic  field  (E,H)  verifies  the  Maxwell  equations.  Using  the  (LAM),  these  equations  write  : 


f  Mo  ™  +  rot(E)  = 

[  -rot(H)  =-\(t)E- 1 


(19) 


where  the  current  density  is  given  by: 


J(t,x)  =  j  vf{t,x,v)dv. 


(20) 


The  Vlasov  equation  is  numerically  solved  by  means  of  a  deterministic  particle  method  [5].  For  the  numerical 
resolution  of  the  Maxwell  equations,  we  use  the  same  scheme  as  in  the  previous  section. 

In  the  following  we  consider  the  transverse  electric  mode  on  a  domain  9  =]°’  lm[x]0,  lm[.  Two  kinds  of  charged 
particles  are  injected  on  the  right  side  of  the  box: 


and  : 


ge(t,x,v)  =  | 
9i(t>  x,v)  =  | 


—(2  -I-  cos(wt))  ■  6(u  -  3), 

0, 


+(2  -F  sin(ut))  ■  6(v  -  3). 

0, 


x  =  0,  vx  >  0, 
otherwise, 


x  =  0,vx  >  0. 
otherwise, 


where  u  -  2-v  =  (resonant  frequency  of  the  box).  Figure  (6),  (7)  shows  the  electro-magnetic  energy  of  the 

solutions  and  the  H z  component  of  the  electro-magnetic  field  calculated  by  the  two  methods.  The  good  behavior  of 
the  solution  calculated  by  the  (LAM)  can  be  seen,  whereas  the  (CM)  seems  to  be  non  adequate  for  the  computation 
of  the  periodic  solutions. 
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Figure  6:  Electro-magnetic  energy 


Figure  7:  Component  Hz  of  the  electro-magnetic  field 
(left :  CM  /  right  :  LAM) 


4  Conclusion 

We  have  presented  a  controllability  method  for  the  calculation  of  time-periodic  solutions.  The  key  point  consist 
of  an  absorption  parameter  A(t).  In  a  reduced  ID  model  we  have  shown  that  using  this  method  allows  us  to 
spare  time  without  loss  of  accuracy.  This  methodology  was  applied  to  the  resolution  of  scattering  problems  for 
electro-magnetic  waves  and  to  the  transport  of  charged  particles  in  electro-magnetic  field. 
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Introduction 

The  use  of  high  frequency  (HF)  techniques  in  the  frequency-domain  scattering  can  hardly  be 
overemphasized.  The  results  obtained  by  Geometrical  Theory  of  Diffraction  (GTD)  and  its  uni¬ 
form  versions  have  widely  been  used  in  predicting  the  time-domain  responses  of  targets  through 
inverse  Fourier  transformation  [1,  2,  3]. 

The  direct  application  of  high  frequency  techniques  in  the  time  domain  (TD)  has  been  dis¬ 
cussed  and  proposed  in  [4],  however  only  recently,  this  interest  has  resurfaced  [5,  6,  7].  The 
advantages  are  several:  efficient  and  faster  computation  than  inverse  Fourier  transformation  of 
frequency  domain  solution,  more  suitable  solutions  when  the  pulse  width  is  narrow  compared  with 
the  geometrical  dimensions  of  the  scattering  object,  feasibility  to  implement  a  hybrid  solution  by 
combination  with  various  numerical  time  domain  methods  such  as  FDTD  and  TLM. 

Obviously,  these  HF-based  time-domain  solutions  are  not  accurate  when  the  input  pulse  has 
strong  low  frequency  components.  The  other  difficulty  in  applying  GTD  (and  its  uniform  versions) 
in  the  TD  is  that,  GTD  requires  a  convolution  of  the  input  field  with  the  so  called  ’’time-domain 
diffraction  coefficients”.  Some  suggestions  have  already  been  made  to  make  this  convolution 
more  efficient  [7].  However,  equivalent  edge  currents  (EEC)  which  are  widely  employed  in  the 
frequency-domain  do  not  appear  to  be  exploited  directly  in  the  time-domain.  In  addition  to 
all  the  advantages  in  the  frequency-domain,  EEC  approach  in  the  time-domain  does  not  require 
any  formal  convolution  integral  with  the  incident  field.  So,  in  this  paper  the  time-domain  EEC 
(TD-EEC)  have  been  derived,  and  employed  for  the  scattering  from  perfectly  conducting  planar 
geometries. 


Derivation  of  time-domain  equivalent  edge  currents 


In  the  physical  optics  (PO),  the  surface  currents  are  approximated  through  geometrical  optics; 
namelv  they  are  twice  the  tangential  component  of  the  incident  magnetic  field  on  the  illuminated 
side,  and  zero  on  the  shadowed  side.  The  far-zone  field  is  then  found  from  the  radiation  integral 
of  these  currents  as: 


EPO 


-^—s  x  s  x  [  f  2 n  x 

47 TTC  J  J S (illuminated) 


dVnc{r  -  r7  •  s’/c) 
dt 


ds' 


xOn  leave  from  Bilkent  University,  Dept.  Elect.  Eng,  Bilkent  06533  Ankara,  Turkey 


(1) 
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where  r  -  (t  -  ^  + ~),  Z0  is  the  free  space  impedance,  c  is  the  velocity  of  light  in  vacuum,  s'  and 
s  are  the  unit  vectors  for  the  incident  and  radiation  direction  respectively,  and  finally  n  is  the 
unit  normal  to  the  surface.  It  is  noted  that  the  PO  surface  current  is  retarded  by  the  observation 
distance  and  additional  path  at  any  source  point  ¥'.  This  approach  has  been  applied  and  discussed 
in  [5]  mainly  for  the  radiation  from  reflector  antennas.  For  planar  perfectly  conducting  scatterers 
and  for  far  field  observation,  the  PO  surface  integration  in  (1)  can  be  reduced  to  a  contour 
integration  via  Stokes’  theorem.  The  resulting  expression  is  given  as: 


(s'  -  s)  •  (/  -  azaz ) 


es  =  —s  x  s  x  &  h  x  7T’ 

nr  Jc  |  (s'  -  s)  •  (/  -  azaz )  | 


■Idl 


(2) 


where  the  scatterer  is  assumed  to  lie  on  the  zy-plane,  az  is  the  unit  vector  in  ^-direction  and  / 
is  the  unit  dyad.  By  comparing  this  expression  with  the  radiation  integral  of  equivalent  electric 
and  magnetic  line  sources  on  the  edge  contour  given  by 


= il 7*'sx  l£/(r)“+ ik1  x  I/cm(t)“ 


(3) 


one  obtains 

I(r)  =  /  [z;lDl?nc{t')  - 1  +  Dih^it')  •  l]c  dt'  (4) 

J- 00 

and  fT 

M(r)  =  /  [Z0D??Vnc(t')  •  l]c  dt'  (5) 

J-o o 

where  D[,  D{  and  Djf  are  the  angle  dependent  coefficients.  It  is  noted  that  the  scattered  field 
is  given  in  terms  of  an  edge  contour  integration  and  requires  no  convolution  integral.  Also,  since 
the  consequtive  time  derivative  and  integrations  cancel,  it  is  directly  related  to  the  incident  field 
rather  than  its  derivative.  The  coefficients  D[,D\ •,  and  Dff  are  exactly  the  same  as  the  ones  in 
the  frequency  domain  EEC  [8],  and  they  contain  a  PO  part  and  a  fringe  part.  The  PO  part  is 
given  by  the  following  coeficients[9] 

/  F0  _  _ sin  (sin  cos  <j>  +  sin  ft'  cos  <f>') _ 

e  “  ~2  [(cos  / 3  -  cos  0')2  +  (sin  /3  cos  <j>  +  sin  ft  cos  <£')2]  sin  /?' 


D, 


,I,PO 

h 


=  2 


(cot/3  cos  <f>  +  cot/3'  cos  ft)  (sin  /3cos<ft  +  sin  /3'  cos  ft) 
[(cos  -  cos  /3')2  +  (sin  /3  cos  <f>  +  sin  /3'  cos  <f/)2] 


(7) 


dm,po  _  2 _ sin  (ft(sin  0  cos  </>  +  sin  /3'  cos  </>') _  ,gx 

h  [(cos (3  —  cos  f3')2  +  (sin 0  cos  0  +  sin  0'  cos <f>')2]  sin/3 

where  the  angles  /3',  /3,  (/>',  and  <j>  are  the  diffraction  angles  as  shown  in  Figure  1.  f3'((3)  is  the 
angle  between  the  edge  tangent  and  the  incident  (scattering)  direction.  </>'(</>)  is  measured  on 
the  normal  plane  from  the  plate  surface  to  the  projection  of  the  incident  (scattering)  direction. 
These  coefficients  when  substituted  into  the  contour  integration  in  (3)  exactly  recover  the  PO 
surface  integration.  Using  the  developments  in  the  frequency  domain  [10],  one  can  improve  the 
equivalent  current  formulation  over  PO  by  adding  the  non-uniform  or  fringe  current  contribution. 
The  coefficients  for  this  contribution  are: 
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Figure  1:  Definition  of  the  diffraction  angles  for  an  edged  geometry. 


[f  _ sin ((^72) (7  -  sin/3'cos(</>72)) _ _ 

De'  =  _4sin  /3'[sin /3'{ sin /3  cos <f>  +  sin  f3' cos <f>')  +  cos (3' (cos /3  -  cos P’)\ 

j  j  sin  (3'[cotf3  cos  (j)  +  cot  ft'  cos  <f>'  +  cos(^>72)7  1C] 

=  "2  sin  P'(sm  0  cos  (j)  +  sin  /?'  cos  <f>')  +  cos  /3'( cos  /?  -  cos /?') 

M  j  - _ sin /?' sin  </>[!  -  sin /3' cos{<t>' /2)'',~1] _ 

D‘"  ’  =  _2sin  /3[sin  /3'(sin  /3cos(f>  +  sin  /?'  cos  </>')  +  cos  /3'(cos  /3  -  cos  /?')  j 

7  =  [(sin  /3'(sin $  -  sin 0  cos  <f>)  -  cos  /3'(cos  £  -  cos  /3'))/2]  * 


(9) 

(10) 
(11) 
(12) 


C  =  cot/?' [sin /3 cos 0  +  co0{ cos (3  -  cos/3')]  -  sin  01  cot/3  cos 0.  (13) 

The  substitution  of  the  sum  of  these  coefficients  in  (6-8)  and  (9-11)  into  (3)  should  yield  the  first 
order  diffraction  from  the  planar  scatterer. 


Results 

A.  TLM  calculation  of  a  gaussian  pulse-excited  dipole  in  the  presence  of  a  metallic  enclosure 
(see  Figure  2)  has  been  given  in  [11].  The  metallic  box  has  a  height  of  10  cm,  a  width  of  34 
cm  and  a  depth  of  10cm.  The  dipole  is  directly  in  front  of  the  box  with  a  separation  of  55 
crm  The  orientation  of  the  dipole  is  vertical  and  the  scattered  fields  in  the  vertical  direction  are 
considered.  There  are  also  two  small  apertures  on  the  front  wall  of  the  box  which  cause  some 
interior  resonances  to  occur.  Backscattering  and  resonances  of  the  metallic  enclosure  which  is 
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Discretized  region  of  the  TLM  method 


Figure  2:  A  short  dipole  antenna  interfering  with  metallic  enclosure. 

interfered  by  the  antenna  at  point  Pi  in  distance  of  5  cm  from  the  aperture  is  computed  using 
TLM  with  a  metallic  wall  thickness  of  5  mm.  The  same  geometry  is  also  analyzed  using  PO,  in 
which  case  apertures  on  the  front  wall  are  neglected  so  the  surface  currents  are  assumed  to  flow 
on  the  perfectly  conducting  front  wall.  The  results  are  shown  in  Figure  3  and  the  comparison  is 
generally  good  up  to  about  3  nanoseconds.  After  this  time,  higher  order  diffraction  and  interior 
resonances  take  place  which  are  ignored  in  the  PO.  Since  the  observation  point  Pi  is  quite  near 
to  the  enclosure,  the  radiation  contribution  due  to  divergence  of  the  PO  surface  current  is  also 
included  in  the  calculations,  which  improved  the  calculations  slightly. 

The  second  geometry  studied  is  a  square  plate  for  which  a  space-time  integral  equation  solution 
is  given  in  the  literature  [12].  The  incident  field  is  a  plane  wave  of  gaussian  shape  given  by 

einc(t)  =  -Le-W*)2  (14) 

y/TTa 

where  o  is  taken  as  1.11c"1  (not  specified  in  [12]). 

The  axial  backscatter  (normal  incidence)  plots  using  TD-EEC  of  (3)  are  calculated  and  com¬ 
pared  with  the  space-time  integral  equation  approach  in  Figure  4.  The  second  positive  pulse  in 
the  solution  of  [12]  is  due  to  "waves  traveling  across  the  plate  face  and  returning  to  the  observer”. 
This  higher  order  effect  is  not  included  in  the  TD-EEC  analysis. 

Note  that  in  this  geometry,  a  direct  application  of  GTD  is  not  possible  due  to  caustic  problem. 
It  should  be  mentioned  that  another  reason  for  the  poor  comparison  in  Figure  4  is  because  of  the 
strong  low  frequency  component  of  the  input  pulse.  (In  the  previous  example,  the  incident  field 
coming  from  the  dipole  was  proportional  to  the  second  derivative  of  a  gaussian  (such  as  in  (14)), 
so  the  low  frequencies  were  suppressed). 
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Scattered  H-field 


Til 


nanoseconds 


Figure  3:  Comparison  of  the  field  for  the  metallic  box.  Solid  line:  PO,  dashed  line:TLM 


Figure  4:  Gaussian  pulse  response  of  a  square  plate  of  size  1m.  The  solid  line  is  the  TD-EEC 
and  the  dashed  lines  are  space-time  integral  equation  calculations. 


Conclusions 


Time  domain  equivalent  edge  currents  are  derived.  Their  limitations  and  theoretical  advantages 
over  the  other  high-frequency-based  time-domain  approaches  are  discussed.  The  applications  for 
the  planar  perfectly  conducting  scatterers  are  presented. 
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Abstract.  In  this  paper,  we  investigate  different  mappings  between  the  discretized  electromagnetic  fields 
and  the  state  variables  of  the  three-dimensional  TLM-SCN  scheme.  The  different  mappings  are  studied 
with  respect  to  their  suitability  to  the  solution  of  a  numerical  initial  value  problem  (IVP).  The  convergence 
rate  of  the  relative  error  between  a  benchmark  solution  and  the  solution  obtained  by  simulations  with 
coarser  discretizations  is  studied.  It  is  shown,  that  both  the  TLM-SCN  method  without  stubs  for  modelling 
homogeneous  media  and  the  TLM-SCN  method  with  stubs  for  modelling  inhomogeneous  media,  converge 
with  second  order  accuracy,  if  the  initial  value  (IV)  is  properly  applied.  In  the  case  of  TLM-SCN  without 
stubs,  it  emerges  that  imposing  the  initial  condition  at  the  boundary  of  the  TLM  cell  and  also  sampling 
the  fields  at  the  boundary,  yields  a  much  smaller  error  constant.  This  result  shows,  that  the  accuracy  of 
the  TLM  scheme  is  not  only  dependent  upon  the  dispersion  of  the  SCN  mesh,  but  also  influenced  by  the 
mapping  between  discretized  fields  and  TLM  state  variables.  However,  imposing  the  initial  condition  at 
the  boundaries  of  the  TLM  cells  in  a  TLM-SCN  network  with  stubs  results  in  the  excitation  of  spurious 
modes.  The  reason  for  this  is  that  the  initial  states  of  the  stubs  are  not  properly  set. 


1.  Introduction 

In  previous  work,  the  dispersion  and  coarseness  error  and  the  performance  of  absorbing  boundary 
conditions  of  the  SCN  -  TLM  method  were  studied  [9, 10].  The  influence  of  the  mapping  between  the 
discretized  electromagnetic  fields  and  the  TLM  state  variables  on  the  error  of  the  numerical  solution 
was  not  subject  of  these  studies.  In  this  paper,  we  investigate  the  influence  of  the  mapping  between 
the  discretized  electromagnetic  fields  and  the  TLM  state  variables  on  the  convergence  of  various  TLM 
schemes. 

The  original  formulation  of  the  symmetrical  condensed  node  based  on  network  analogies  was  given  by 
Johns  [6].  The  initial  field  is  imposed  on  the  TLM  network  at  the  node  and  the  discretized  electromagnetic 
fields  are  sampled  at  the  node  as  a  function  of  the  incident  TLM  state  variables.  Later,  this  formulation 
was  extended  to  include  losses  [11]. 

In  the  literature,  various  other  mappings  between  the  sampled  electromagnetic  field  quantities  and 
the  TLM  state  variables  were  introduced.  In  a  Method  of  Moments  based  derivation  of  the  SCN,  the 
initial  field  is  imposed  on  the  TLM  network  and  the  fields  are  sampled  at  the  boundaries  of  a  TLM  cell 
[7].  By  this  definition,  an  isomorphic  mapping  between  the  24  discretized  electromagnetic  fields  and  the 
24  state  variables  at  the  boundary  (taking  both  incident  and  reflected  states  into  account)  of  a  TLM 
cell  is  possible.  Essentially  the  same  mapping  (apart  from  a  normalization  to  power  waves)  between 
discretized  field  and  TLM  states  was  used  in  a  derivation  of  the  SCN  based  on  Finite  Integration  [1]. 
The  finite  difference  interpretation  of  the  SCN  given  by  [2]  can  be  interpreted  as  imposing  the  IV  at  the 
boundary  and  sampling  the  fields  at  the  centre  of  a  cell.  Here,  the  mapping  between  the  discretized  field 
and  the  TLM  states  at  the  boundary  is  the  same  as  in  [7],  although  a  different  notation  is  used.  At  the 
node,  Johns’  original  mapping  between  the  incident  state  variables  and  the  discretized  field  is  used.  For 
the  SCN  with  stubs,  a  finite  difference  interpretation  following  [5],  and  an  interpretation  of  a  discrete 
propagator  integral  [4]  exist. 
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In  this  paper  we  investigate  the  different  methods  of  imposing  the  initial  field  on  the  TLM-SCN 
network  and  the  different  ways  of  sampling  the  electromagnetic  field  with  respect  to  their  convergence 
following  [4, 6, 7, 11].  In  the  next  section,  the  different  mappings  are  shown. 


2.  Comparison  of  various  mappings 

Using  the  notation  presented  in  [12],  the  discretized  electromagnetic  fields  and  the  TLM  state  variables 
at  the  boundary  of  a  TLM  cell  are  related  by 
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where  a  denotes  incident  TLM  pulses,  6  reflected  TLM  pulses,  and  k,l,m,n  are  temporal  and  spatial 
indices  of  the  respective  TLM  cell.  I  denotes  the  12  by  12  unity  matrix.  Z  is  the  impedance  of  free 
space. 

At  the  node,  we  have  to  distinguish  the  mappings  of  the  various  authors.  For  Johns’  formulation  of 
the  SCN  without  stubs,  the  relation  between  electromagnetic  fields  and  TLM  state  variables  at  the  node 
reads  with  /  =  [Ex,  Ey,  Ez,  ZHX ,  ZHy,  ZHZ)T 
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If  the  SCN  has  stubs,  the  mapping  is  as  follows,  where  in  the  case  of  the  SCN  following  [6]  without 
losses,  Gx,  Gy,  Gz,  Rx,  Ry,  Rz  =  0: 
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with  the  coefficients 
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internal  degrees  of  freedom  and  consequently  not  mapped.  It  should  be  stressed,  that  it  was  assumed  in 
[4],  that  when  the  TLM  scheme  is  started,  there  must  be  no  initial  field  present.  At  the  cell  boundary, 
the  mapping  between  the  discretized  electromagnetic  fields  and  the  TLM  state  variables  is  given  by  (1). 
The  fields  at  the  node  are  taken  as  the  averages  of  the  fields  at  the  boundaries,  i.e. 
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3.  Numerical  analysis  of  convergence 

To  compare  the  convergence  of  various  TLM  schemes,  the  propagation  of  a  linearly  polarized  Gaussian 
pulsed  plane  wave  in  an  axial  direction  of  the  TLM  mesh  was  analyzed  numerically.  Fig.  1  shows  the 
investigated  structure.  For  a  plane  wave,  linearly  polarized  in  z-direction,  propagating  in  x-direction,  a 
linear  row  with  a  single  TLM  cell  in  the  transverse  directions  is  representative.  The  ports  of  the  TLM 
cells  are  short-circuited  in  z-direction  and  open  in  y-direction.  Cubic  TLM  cells  were  considered.  The 


6B0 


no. 

Alno  [A vm] 

Nno 

Mno 

1 

1.0 

70 

6 

2 

0.5 

140 

12 

3 

0.25 

280 

24 

4 

0.125 

560 

48 

5 

0.015625 

4480 

384 

Table  1 .  Parameters  of  different  discretization  levels. 


size  of  a  TLM  cell  is  A l.  The  initial  field  is  distributed  across  the  distance  Axex.  The  distance  between 
initial  field  and  observation  point  is  denoted  Ax. 

The  numerical  IVP  was  solved  with  various  discretisations.  The  respective  simulation  parameters  can 
be  found  in  Table  1.  Fig.  2  shows  schematically  the  TLM  models  for  two  subsequent  discretization  steps. 
The  upper  sketch  refers  to  the  case  where  the  fields  are  sampled  at  the  node,  and  the  sketch  at  the 
bottom  refers  to  the  case  where  the  fields  are  sampled  only  at  the  boundaries  of  a  TLM  cell. 

The  initial  field  distribution,  given  by  the  discretized  truncated  Gaussian  pulse 

El-^x)  —  / Eoe  ^  ^  for  Zty-Alno  <x  —  xo<  + 1)  Alno  (9) 

[  0  elsewhere 


is  distributed  across  the  distance  Axex  =  NnoAlno.  The  region,  where  the  initial  field  is  given,  is 
symbolized  by  the  shaded  cells  in  Fig.  2.  The  pulse  is  centred  around  Axq  =  (Nno/2  +  l)AZno,  i.e. 
around  a  node  in  the  one  case  or  a  boundary  between  two  cells  in  the  other  case  (see  the  dashed  line  in 
Fig.  2).  The  width  of  the  pulse  is  \/2cr  =  NnoAlno/p,  where  p  was  chosen  3\/2. 

The  overall  length  of  the  structure  was  chosen  such,  that  in  connection  with  the  duration  of  the 
simulation  no  reflections  from  the  terminating  walls  in  the  positive  and  negative  x-direction  occured  at 
the  observation  point. 

In  the  case  with  stubs,  a  medium  with  et  =  2,  and  pT  =  1  was  assumed.  In  the  case  of  a  lossy  medium, 
the  electric  conductivity  was  chosen  isotropically  ae  —  305m-1. 

If  one  reduces  At,  A 1,  the  deviation  between  the  numerical  solution  for  finite  At,  A l  and  the  exact 
solution  decreases  by  the  order  of  accuracy  of  the  method  [13].  In  this  experiment,  the  exact  solution 
was  given  by  a  benchmark  solution,  obtained  from  a  simulation  with  very  small  cell  size  (no.  5  of  Table 
!)■ 

The  relative  error  between  reference  and  simulated  solution  is  given  by 


V  = 


£t(kla  +  |v««*p) 

, bench  l2  +  l^®W 
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with  uk  =  B^beIlch  -  E|. 


(10) 


Vt  denotes  the  backward  difference  between  two  neighbouring  elements  devided  by  the  time  step  of 
the  respective  discretization  level,  to  approximate  the  temporal  partial  derivative.  The  relative  error  is 
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Figure  2.  TLM  models  for  two  subsequent  discretization  steps. 


calculated  using  the  norm  of  an  adequate  discrete  Sobolev  Space,  where  the  approximations  of  the  first 

partial  derivatives  are  also  considered  [14].  c-vic 

Table  2  indicates  the  different  investigated  mappings  between  the  discretized  electromagnetic  fields 

and  the  TLM  state  variables. 

Table  2:  Location  of  excitation  and  field  sampling  of  different  TLM  -  SCN  inter¬ 
pretations. 

~  no.  Excitation  Sampling  Stubs  lossy  Author 

A  node  node  no  -  [6] 

B  boundary  boundary  no  _ [7] 

~ C  node  node  yes  no  [6] 

D  node  node  yes  yes  [11] 

E  boundary  boundary  yes  no  [4] 


4.  Results 

4.1.  TLM  without  stubs.  Fig.  3  shows  the  relative  error  n  for  TLM  without  stubs  (case  A  and  B 
Table  2).  In  case  A,  when  the  initial  field  is  mapped  to  the  TLM-SCN  network  at  the  node  and  the  fields 
are  also  sampled  at  the  node,  one  can  recognize  a  second  order  convergence  for  coarser  ^*^io^ 
However,  this  result  shows,  that  the  mapping  between  the  discretized  fields  and  the  TLM  states  has 
an  influence  on  the  accuracy  of  the  TLM  solution,  as  the  SCN  mesh  without  stubs  exhibits  no  dispersion 

m  Sughrethe°snecond  order  convergence  corroborates  the  results  of  other  work  where  an  overall  second 
order  convergence  was  predicted  with  the  aid  of  the  dispersion  relation  of  the  TLM  mesh  [8  10J- 
In  case  B,  when  the  initial  data  are  imposed  on  the  TLM-SCN  network 
are  also  sampled  at  the  boundary,  a  much  smaller  relative  -  error  rj  is  observed.  As  the  TLM-SCN  schem 
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Figure  3.  The  relative  -  error  of  the  Ez  field  of  SCN-TLM  without  stubs  for  various  discretizations. 


Figure  4.  Excitation  of  spurious  modes,  when  applying  an  IV  at  k  =  1  at  the  cell 
boundary  and  field  sampling  at  the  cell  boundary  (case  E  of  Table  2). 

without  stubs  has  no  dispersion  in  the  axial  directions,  we  only  see  the  numerical  round-off  error  (we 
worked  with  single  precision  arithmetic),  as  expected. 

4.2.  TLM  with  stubs.  First,  we  want  to  have  a  look  what  happens  when  we  impose  the  initial  field  at 
the  boundaries  of  the  TLM  cells  with  stubs.  If  we  map  the  fields  at  the  boundary  (Case  E,  Table  2),  we 
observe  the  excitation  of  spurious  modes  (see  Fig.  4) .  This  is  due  to  a  wrong  initialization  of  the  stubs. 

However,  it  was  stressed  in  [4],  that  the  initial  electric  and  magnetic  fields  have  to  be  zero,  when  the 
TLM  scheme  is  started.  If  this  condition  is  neglected,  one  will  inevitably  excite  spurious  modes.  The 
excitation  of  spurious  modes  when  imposing  an  initial  field  at  the  boundary  was  also  observed  by  others 
[3]. 

Besides,  sampling  the  fields  at  the  node  when  imposing  the  initial  field  at  the  boundary  [4],  averages 
out  most  of  the  spurious  modes  (see  Fig.  5),  which  is  a  bit  surprising. 

No  spurious  modes  where  observed,  when  imposing  the  IV  on  the  TLM-SCN  network  in  the  manner 
proposed  by  Johns.  This  is  due  to  also  exciting  the  stubs  correctly  and  not  treating  them  as  internal 
degrees  of  freedom  as  in  [4]. 

As  it  makes  no  sense  to  study  the  convergence  when  spurious  modes  are  excited,  we  have  only  compared 
the  convergence  rates  of  case  C  and  D  of  Table  2,  i.e.  with  and  without  ohmic  losses.  The  results  are 
compiled  in  Fig.  6. 
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Figure  5.  Excitation  with  spatial  field  distribution  at  k  —  1  at  the  cell  boundary.  Field 
sampling  at  the  node  averages  out  most  of  the  spurious  modes  (case  E  of  Table  2). 


Figure  6.  The  relative  -  error  of  the  Ez  field  of  SCN-TLM  with  stubs  for  various 
discretizations  when  exciting  at  the  node  (cases  C  and  D  of  Table  2). 

For  TLM  with  stubs  following  Johns’  and  Naylor  et  al.’s  interpretation  (case  C  and  D),  the  error  de¬ 
creases  with  second  order  for  coarser  discretization  levels,  as  predicted  by  [5].  However,  we  can  recognize 
a  flattening  of  the  error  curve  towards  smaller  cell  sizes. 

5.  Conclusion 

We  have  investigated  the  accuracy  of  different  mappings  between  the  discretized  electromagnetic  fields 
and  the  TLM  state  variables  when  solving  a  discrete  IVP  with  the  TLM-SCN  algorithm.  It  emerged, 
that  the  mapping  between  discretized  electric  and  magnetic  field  components  and  TLM  state  variables 
influences  the  accuracy  of  the  TLM  solution  as  well  as  the  dispersion  behaviour  of  the  TLM-SCN  mesh. 

Considering  a  TLM-SCN  mesh  without  stubs,  imposing  the  initial  values  at  the  boundary  of  the  TLM 
cells  and  also  sampling  the  fields  at  the  boundary,  yields  an  error  of  the  order  of  numerical  round-off 
errors  for  the  axial  -  directions,  as  expected  by  inspecting  the  dispersion  relations  of  the  SCN  mesh 

without  stubs.  , 

Imposing  the  initial  field  distribution  at  the  node  and  sampling  the  fields  at  the  node,  we  observed 
a  second  order  convergence  in  axial  direction  and  not  a  dispersion  free  propagation,  which  indicates 
the  influence  of  the  mapping  between  discretized  fields  and  TLM  state  variables  on  the  accuracy  of  the 
numerical  solution. 
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In  the  case  of  a  TLM-SCN  mesh  with  stubs,  we  observed  second  order  convergence  in  axial  direction, 
when  the  fields  are  sampled  at  the  node  and  the  initial  values  are  imposed  at  the  node,  as  in  Johns’ 
original  formulation.  The  observed  flattening  of  the  error  curve  towards  finer  discretizations  is  subject 
to  further  studies. 

Imposing  the  initial  field  on  a  TLM-SCN  network  with  stubs  only  at  the  boundary  results  in  the 
excitation  of  spurious  modes.  This  is  due  to  a  wrong  initialization  of  the  stub  states.  Therefore  we  can 
apply  this  mapping  only  in  cases  where  we  have  initial  values  zero,  i.e.  all  electric  and  magnetic  field 
components  axe  initially  zero  (this  was  stressed  in  [4]).  The  excitation  of  the  TLM  network  has  then 
to  be  imposed  in  the  sense  of  an  initial-boundary  value  problem,  where  one  specifies  the  electric  and 
magnetic  field  components  at  the  boundaries  of  some  arbitrary  TLM  cells  as  a  smooth  function  of  time. 

This  means,  that  if  we  have  initially  a  field  free  space,  we  can  choose  freely  between  imposing  the 
initial  values  at  the  node  or  at  the  boundary,  without  decreasing  the  convergence  order. 

If  one  has  an  initial  field  distribution  unequal  zero,  one  has  to  use  the  mapping  between  the  discretized 
field  distribution  and  the  TLM  states  as  proposed  by  Johns  or  Naylor  et  al.  in  case  of  modelling  a  lossy 
medium. 
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Introduction 

Radiofrequency  (RF)  exposure  of  metallic  objects  can  induce  differential  heating  of 
neighbouring  biological  tissues.  Conductors  may  be  located  external  to  the  body  or  within  the 
body's  tissues.1  Fig  la  shows  a  sternum  implant  following  open-heart  surgery  and  Fig.  lb  an 
implant  resulting  from  a  fracture  [Joyner  et  al.  1988]  [Fleming  et  al.  1992].  When  an  implant  is 
illuminated  by  a  RF  field  the  absorbed  power  of  the  incident  field  may  be  preferentially 
redistributed  at  particular  points  adjacent  to  the  implant.  In  such  cases,  the  resonant  response  of 
the  implant  requires  determination.  This  report  examines  the  electric  field  and  RF  currents 
associated  with  a  rod-type  wire  implant  embedded  in  muscle. 


Fig.  la:  Sternum  implant  Fig.  lb:  Ankle  implant 

A  common  belief  is  that  electric  fields  and  derived  quantities  such  as  specific  absorption 
rate  (SAR)  are  resonant  when  the  implant  length  is  a  half- wavelength  in  tissue,  ie  A.j/2. 
However,  in  this  report  it  is  found  that  electric  fields  and  other  derived  quantities  are  resonant  at 
lengths  shorter  than  XT/2.  This  occurs  because  currents  on  implants  are  influenced  by  the  highly 
conductive  nature  of  the  surrounding  tissues  [King  and  Harrison  1969],  Apart  from  its 


1  Conductors  surgically  positioned  within  the  body  are  termed  implants  as  distinct  from  electrodes,  to  indicate 
that  their  shape  and  material  are  not  designed  for  any  short-term  electrical  effect,  but  can  be  used  as  a 
biologically  inert  agent  within  the  body,  eg  as  a  brace  during  recalcification  of  bone  fractures,  in  place  for 
extended  periods  up  to  ten  years  or  more.  In  this  report,  we  use  the  term  wire  implant  to  indicate  the  origin  of 
the  study. 
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theoretical  interest,  this  finding  has  relevance  for  effective  hyperthermia  design  [Iskander  and 
Tumeh,  1989]  and  in  the  assessment  of  RF  heating  effects  around  metallic  implants. 

King  and  Harrison  [1969]  studied  rod  antennas  in  dissipative  media  from  a  theoretical 
viewpoint.  They  derived  analytic  expressions  for  currents  and  near  fields  of  wires  of  various 
lengths.  Also,  King  and  Iizuka  [1963]  obtained  the  currents  on,  and  fields  near,  quarter-  and 
half-wave  dipoles  in  a  series  of  saline  solutions.  Examination  of  input  admittance  as  a  function 
of  the  electrical  length  of  the  rod  antenna  reveals  that  as  the  conductivity  increases,  resonance 
occurs  (periodically)  at  less  than  X^/2  [King  and  Harrison  1969]. 

Fleming  and  Joyner  used  a  thin-wire  antenna  code  AMPC  to  study  wire-type  structures 
immersed  in  an  infinite  conducting  medium  [1989],  AMPC  was  validated  using  the  analytical 
solutions  and  experimental  results  obtained  by  King  and  Harrison  [1969]  and  King  and  Iizuka 
[1963],  MMP  a  general  purpose  semi-analytic  electromagnetic  code  based  on  the  multiple 
multipole  method  [Hafner  and  Bombolt  1993]  is  also  suitable  for  examining  the  fields  and 
currents  associated  with  thin  wire  structures.  In  the  following,  these  two  codes  have  been  used  to 
determine  the  induced  current  and  total  electric  fields  near  the  tip  of  a  particular  test  rod-type 
wire  implant.  This  leads  to  the  concept  of  a  ‘region  of  influence’.  The  effect  of  implant  tip 
shape  on  localised  SAR  enhancement  is  also  investigated. 

Method 

A  30  mm  rod  of  2  mm  radius  was  selected  as  the  wire  implant  test  case.  Both  codes, 
AMPC  and  MMP,  were  used  to  produce  volume-averaged  estimates  of  the  squared  magnitude  of 
the  total  electric  field  (incident  and  scattered),  |Ep,  over  a  1  cm3  volume  of  tissue  adjacent  to  the 
implant  tip  although  different  methods  were  used  in  determining  this  average  (see  Fig.  2),. 
AMPC  used  a  spherical  averaging  region  centred  on  the  tip,  while  MMP  used  a  cubic  averaging 
volume  that  abutted  the  tip  corresponding  to  a  1  gram  cube  of  tissue  [ANSI  1990],  The  square 
of  the  electric  field  is  directly  proportional  to  the  rate  of  absorption  of  RF  energy,  the  SAR.  A  1 
V/m  incident  field  was  incident  on  the  rod. 


Fig.  2  Rod  implant  30  mm  long  —  total  electric  field  magnitude  squared  was  averaged  over 
1  cm*  volume  of  tissue  next  to  tip.  AMPC  used  a  spherical  averaging  volume  centred  on  the 
tip,  while  MMP  used  a  Cartesian  volume  abutting  the  tip. 
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Estimates  of  the  currents  on  the  rod  and  the  electric  fields  at  the  implant  tips  were  sought 
at  1,  27.12,  40.68,  80,  200,  433,  600,  915,  2450,  5800,  10000  MHz.  These  frequencies 
include  those  commonly  used  within  the  bioeffects  community  [NCRP  1981].  Additional 
frequencies  at  80  (whole-body  resonance),  200  and  600  MHz  were  also  included  so  that  uniform 
coverage  was  obtained  in  the  range  1  -  10,000  MHz.  For  the  test  case,  the  30  mm  wire  implant 
was  modelled  to  be  immersed  in  muscle  (high  water  content)  tissue.  Tissue  dielectric  properties 
were  based  on  dielectric  data  obtained  from  the  NRCP  report  [1981]  (AMPC),  and  the  Hurt 
polynomials  [Dumey  et  al.  1986]  (MMP).  While  slightly  different  tissue  dielectric  parameters 
were  used  by  the  two  codes,  the  same  characteristic  results  for  the  currents  and  the  fields  are 
expected. 

The  wires  were  modelled  using  segment  lengths  that  were,  at  most,  a.t/10  for  the  tissue  in 
which  the  implant  was  assumed.  There  are  differences  in  the  way  in  which  the  two  codes 
segmented  the  model  of  the  rod.  Like  NEC,  AMPC  uses  a  thin-wire  kernel  [Burke  and  Poggio 
1981],  and  as  such  models  the  tip  region  as  a  'ring  of  current'.  MMP  on  the  other  hand,  is  more 
versatile,  capable  of  modelling  the  implant  as  a  filamentary  wire  similar  to  AMPC,  or  as  an 
explicit  cylinder  while  modelling  the  tip  as  flat  or  pointy.  The  results  shown  here  represent  part 
of  the  validation  for  a  much  more  extensive  research  effort  to  obtain  RF  energy  absorption 
estimates  for  a  range  of  different  length  implants  at  various  locations  in  the  body,  eg,  knee. 

Results 

Figure  3  shows  the  AMPC  calculated  current  magnitude  along  the  30  mm  wire  implant 
versus  frequency.  It  is  seen  that  maximum  current  occurs  at  approximately  433  MHz,  ?„T/3. 
Fig.  4  shows  the  AMPC  calculated  volume  averaged  squared  magnitude  of  the  total  electric  field 
near  the  implant  tip  at  the  peak  frequency  433  MHz  as  a  function  of  volume  surrounding  the  tip. 
This  result  shows  that  surrounding  the  tip  there  is  a  'region  of  influence',  in  this  case  around  70 
cm3.,  a  cube  of  around  4.1  mm,  or  a  sphere  of  radius  2.6  mm  Outside  this  region,  no  significant 
effects  occur  due  to  the  wire.  Fig.  5  shows  the  MMP  calculated  variation  in  the  enhancement  of 
the  1  gram  averaged  SAR  versus  frequency  due  to  different  wire  implant  tips.  As  for  the 
induced  current,  peak  SAR  occurs  at  ~A,T/3. 

Discussion 

The  RF  induced  currents  on  conducting  objects  in  dissipative  media  are  dependent  on  the 
type  of  medium  in  which  the  interaction  takes  place  with  resonance  occurring  at  a  lower 
frequency  than  in  free-space.  We  propose  that  this  is  because  currents  in  highly  dissipative 
media  tend  to  be  concentrated  near  the  wire  centre  and  are  attenuated  at  points  near  the  tips 
away  from  the  centre.  This  is  a  direct  consequence  of  the  dissipative  medium  acting  on  the  wire 
current  as  it  forms  along  the  wire  length  away  from  the  peak  at  the  wire  centre.  This  parallels 
the  findings  of  King  and  Harrison  who  studied  currents  on  wire  antennas  (radiators)  and  the 
electric  fields  near  them  [1969],  Thus  for  the  wire  implant  examined  in  this  study,  it  is  resonant 
at  a  physical  length  of  approximately  >,x/3.  It  is  interesting  to  note  the  effect  of  the  shape  of  the 
implant  tip  on  the  localised  SAR  peak,  with  the  pointed  tip  showing  the  least  enhancement.  The 
difference  between  the  pointed  and  flat  tip  is  probably  due  to  greater  attenuation  between  the 
field  at  the  tip  and  the  nodes  of  the  averaging  volume.  Also  the  SAR  enhancement  is  not  very 
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significant  for  this  implant  at  lengths  longer  than  XT/2  using  either  the  pointed  or  flat  tips. 
Further  investigations  are  needed  to  understand  the  reason  for  the  difference  between  the  pointed 
and  flat  tips  relative  to  the  AMPC  type  tip. 


length  along  scatterer  (mm) 


Fig.  3  AMPC  calculated  current  along  the  30  mm  long  wire  implant  for  various  frequencies  of  the 
incident  EM  field.  The  magnitude  of  the  incident  E-field  is  1  V/m  and  its  polarization  relative  to  the 
implant  is  shown  in  Fig.  2. 
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Fig.  4  Volume-averaged  total  electric  field  magnitude  and  its  square  near  the  tip  of  30  mm  long 
wire  implant  at  433  MHz  calculated  by  AMPC  using  a  spherical  averaging  volume.  The  field  is 
normalised  wrt  the  peak  value  at  the  tip.  The  incident  EM  field  is  shown  in  Fig.  2. 
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Fig.  5:  1  gram  averaged  SAR  for  implant  in  muscle  for  various  tip  shapes.  SAR  is  calculated 
relative  to  an  incident  E_field  of  1 V  m-1  at  the  implant  site  in  the  absence  of  the  wire.  Implant 
length  =  30  mm,  diameter  =  4  mm.  MMP  used  a  cubed  averaging  volume,  and  AMPC  a 
spherical  averaging  volume  (see  Fig.  2). 


Conclusions 

In  this  report,  two  computational  electromagnetics  codes  were  used  to  analyse  the  currents 
on,  and  fields  near,  the  tips  of  a  wire  implant.  Two  codes,  the  thin  wire  code  AMPC,  and  the 
more  versatile  electromagnetics  code  MMP  which  can  model  either  volume,  surface,  and/or 
filamentary  segments,  were  used  to  analyse  a  30  mm  long,  2  mm  radius  wire  implant. 
Resonance  of  the  currents  and  fields  was  found  to  occur  at  about  'kjl'i  (433  MHz).  This 
lowering  of  the  resonant  frequency  of  wire  sactterers  is  in  agreement  with  theoretical  predictions 
for  wire  antennas  in  a  highly  dissipative  medium  [King  and  Harrison  1969).  Also  the  localised 
mass  averaged  SAR  enhancement  is  less  due  to  a  pointed  tip  relative  to  a  flat  tip  as  a  result  of 
the  attenuation  of  the  surrounding  medium. 
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Towards  computational  methods  for  studying  cellular  effects  due  to  EM  fields. 

Author:  A.H.J.  Fleming 
Monash  University,  Clayton,  Australia,  3168 

1.  Introduction 

In  this  paper,  it  is  suggested  that  computations  based  on  multi-particle  methods  be  developed  to 
model  cellular  effects  due  to  static  and  time-varying  electromagnetic  (EM)  exposures.  Plasma  physicists  have 
for  some  time  successfully  modelled  complex  electromagnetic  interactions  by  using  multi-particle  methods 
[Dawson  1983],  Electric  (E-)  and  magnetic  (B-)  fields  can  originate  from  complex  biological  processes  in 
which  case  they  are  often  called  biogenic  fields,  or  due  to  solar,  lunar,  and  planetary-  origin,  and  also  due  to 
modem  technological  advances.  In  biological  tissues  there  are  a  variety  of  charged  and  inducibly  dipolar 
particles  such  as  electrolytes  and  complex  proteins  that  can  respond  to  electric  and  magnetic  stimuli. 
Computational  methods  can  be  an  important  tool  for  the  biologist,  capable  of  providing  comparison  with 
results  from  ‘wet  studies’,  or  as  a  ‘what  if  tool  to  investigate  hypotheses.  Experimental  methods  can  be 
costly  and  difficult  to  set  up  for  cellular  investigations,  and  in  this  situation,  computational  methods  can  yield 
useful  information  prior  to  experimentation.  Computational  methods  can  also  be  used  where  it  is  suspected 
that  weak  effects  are  present  and  where  epidemiological  inference  is  flawed  without  large  population  bases 
upon  which  to  draw.  This  paper  points  to  several  important  situations  where  computational  techniques 
might  be  used  to  assist  the  study  of  cellular  effects  due  to  a  variety  of  EM  fields. 

Bioeffects  due  to  static  and  time-varying  EM  fields  depend  upon  the  biological  microstructure  and 
may  thus  require  to  be  studied  using  microdosimetric  techniques  [Tenforde  1992].  It  is  no  surprise  that 
there  are  effects  due  to  biological  microstructure  that  are  not  measurabie  using  macroscopic  methods.  Fig.  1 
shows  a  schematic  of  a  human  cell.  New  methods  of  electron  microscopy  are  being  developed  and  used  to 
uncover  previously  unrevealed  cel!  structure  of  exquisite  complexity'.  Penman  [1995]  discussed  how  these 
new  images  are  demanding  a  rethink  of  our  previous  ideas  of  cytoskeleton,  nuclear  matrix,  mitosis,  and  the 
relation  of  membranes  to  cytostructure.  Powerful  new  fluorescent  markers  are  also  being  used  to  determine 
cellular  concentrations  of  ions  [Grynldewicz  et  al.  1985]  [Wolke  et  al.  1994],  Motions  of  cell  components 
are  also  being  revealed  using  these  methods  [Sowers  and  Hackenbrock  1981]  [White  et  al.  1990],  In  line 
with  these  advances,  mathematical  models  of  biological  and  cellular  interactions  with  electric  and  magnetic 
fields  based  on  simple  geometries,  eg  cells  and  tissues  represented  by  isolated  layered  spheres  and  ellipsoids, 
also  need  reworking. 


Fig.  1  Schematic  showing  human  cell  microstructure:  (a)  lysosome,  (b)  glycogen,  (c)  ribosomes,  (d) 
mitochondrion,  (e)  golgi  apparatus,  (f)  endoplasmic  reticulum,  (g)  nucleus,  (h)  nucleolus 
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2.  Towards  development  of  methods  of  computational  cell  biology 

Life  in  the  industrialized  world  gives  modem  man  devices  and  opportunities  that  were 
unimaginable  a  century  ago.  However,  we  are  now  bathed  in  a  variety  of  E-  and  B-fields  that  are  often 
much  higher  than  the  field  levels  that  existed  before  modem  times.  The  local  geomagnetic  flux  density  is 
around  1  x  10‘4  T  (1  Gauss)  depending  upon  position  on  the  globe.  Some  medical  devices,  eg.  MRI, 
involve  exposure  to  fields  above  1  T.  Environmental  E-fie!d  levels  vary  between  150  V  m"*  in  fair  weather, 
to  10  kV  m'l  for  brief  periods  during  thunderstorms.  Power  transmission  lines  may  cause  long  term 
exposures  exceeding  10  kV  m--.  Little  wonder  that  reports  of  cellular  bioeffects  due  to  weak  exposure 
levels  attract  concern  among  the  public  and  the  scientific  community.  While  such  reports  have  not  been 
widely  accepted  amongst  the  bioeffects  community,  they  have  caused  a  flurry'  of  research  into  cellular  and 
other  microbiological  effects. 


In  a  recent  review  of  E-  and  B-field  bioeffects,  Carpenter  and  Ayrapetyan  [1994]  detailed  the  need 
for  research  into  the  influence  upon  biological  cells  of  both  static  and  time-varying  fields.  Matters  of 
concern  and  also  opportunity  in  the  areas  of  biohazards,  biomedicine,  and  other  bioelectromagnetic  areas 
such  as  developmental  and  plant  cell  biology,  need  resolution.  Glimpses  of  the  possibilities  east,  such  as  the 
use  of  pulsed  magnetic  fields  (PMFs)  in  bone  refracture  therapy  [Bassett  1989],  high  intensity  PMPs  as  a 
prophylactic  against  the  often  fatal  damage  caused  by  ischaemic  attacks  (blockage  of  arteries)  [Grant  et  al. 
1994],  and  to  mitigate  against  pain  [Ellis  1994].  While  there  are  no  known  mechanisms  for  a  wide  range  of 
experimental  B-field  observations  [WHO  1987],  understanding  of  E-field  effects  is  emerging. 

Over  the  last  decade,  the  body  of  knowledge  of  the  influence  of  endogenous  and  exogenous  E- 
fields  upon  cells  has  been  expanding  rapidly.  While  weak  effects  remain  the  subject  of  debate,  a  range  of 
medium  to  strong  effects  is  now  at  least  partially  understood.  The  range  of  effects  observed  on  single  cells 
due  to  E-fields  includes  electrophoresis,  the  translation  of  whole  cells  in  the  direction  of  the  applied  field, 
dielectrophoresus,  the  movement  of  cells  in  a  dielectric  medium  associated  with  time-varying  fields, 
electrorotation,  the  rotation  (alignment)  of  cells  in  polarized  fields,  membrane  electrophoresis,  the  diffusion 
of  components  within  the  plasma  membrane,  and  electrodefcrmation,  the  distortion  of  the  shape  of  the  cell 
due  to  membrane  stresses. 


In  many  of  these  E-field  effects,  a  quantitative  understanding  has  emerged  and  theory  can  be  tested 
against  computation  and  observation.  However  no  complete  picture  has  emerged  of  the  way  in  which  E- 
fields  operate  to  affect  cell  function.  While  'snap-shots'  of  theories  exist,  numerical  methods  based  on 
three-dimensional  cell  structure  might  provide  a  more  meaningful  'moving  picture'  of  these  effects.  A  model 
which  can  calculate  in  the  primary  unit  of  cell  effect,  namely  any  changes  in  charged  particle  flux  flowing  in 
and  around  cells,  and  then  obtain  any  flux  dependent  effects,  would  be  most  instructive.  As  distinct  from 
one-dimensional  (circuit  analysis  type)  calculations  which  offer  only  an  'on-off  answer,  there  may  be  subtle 
effects  revealed  by  detailed  three-dimensional  numerical  analysis  of  a  process  involving  many  different 
interacting  particles  that  evolves  over  time.  Simple  three-dimensional  models  of  cells  of  this  kind  have 
begun  to  emerge.  Sala  and  Hemandez-Cruz  [1990]  examined  the  intracellular  calcium  inside  a  spherical 
model  of  a  neuron  due  to  buffers  and  tracers.  No  complete  model  that  includes  the  membrane  is  available; 
certainly  nothing  to  assist  study  of  E-field  effects. 


3.  Exposure  of  Celis  to  Electric  Fields. 


There  are  several  interrelated  research  areas  concerned  with  electrical  exposure  of  cells.  In  this 
section,  we  survey  three  reported  non-destructive  bioeffects  with  an  eye  towards  the  eventual 
implementation  of  a  computational  approach  towards  cell  biology.  There  is  a  natural  progression  through 
the  three,  from  direct  electrical  contact,  through  medium  to  strong  non-contact  E-field  exposure,  to  weak  E- 
field  exposure.  Only  the  first  is  accepted  without  question  since  it  can  be  tested  theoretically  against  a 
number  of  observed  effects.  Mechanisms  underlying  the  other  two  are  unclear  at  present.  Not  until  there  is 
validated  understanding  will  these  bioeffects  be  accepted  also  as  resulting  from  E-field  exposure  without 
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question  by  the  bioeffects  community.  This  is  perhaps  where  numerical  calculations  can  be  most  useful,  as 
an  aid  to  theoretical  understanding. 

It  is  worth  pointing  out  that  strong  bioeffects  tend  to  be  able  to  be  modelled  using  lumped  circuit 
analysis  techniques.  As  the  bioeffect  becomes  more  subtle,  circuit  concepts  become  less  adequate  as  the 
particle  diffusion  becomes  more  disperse.  Methods  less  filamentary,  and  more  suited  to  volumetric  analysis 
are  needed,  such  as  finite  difference  and  finite  element  methods  by  which  the  overall  inhomogeneous 
problem  domain  is  subdivided  into  a  series  of  smaller  homogeneous  entities,  and  where  time  is  similarly 
subdivided  into  small  time  increments. 

3.1  Direct  Contact  Effects:  Membrane  Voltage  Patch  Clamp. 


Some  cells  act  toeether  to  form  physiological  systems  such  as  the  heart,  nerve  fibres,  muscles,  and 
neuromuscular  junctions.  When  the  membrane  of  these  cells  are  electrically  •fired',  ions  are  released  across 
the  membrane.  This  occurs  when  a  certain  membrane  potential  has  been  reached.  Commonly  the  cells  are 
then  able  to  propagate  an  ionic  signal  to  their  neighbours  which  are  physically  adjacent  and  a  macroscopic 
effect  follows.  Electrophysiology  is  the  study  of  such  cellular  systems  that  oscillate  between  resting  and 
active  states  and  are  controlled  by  voltages  applied  across  their  membranes. 

The  study  of  membrane  biophysics  owes  much  to  the  pioneering  work  of  Hodgkin  and  Huxley 
beginning  in  the  early  1950’s  [Plonsey  and  Barr  1988].  Using  voltage  patch  clamp  electrodes,  they 
investigated  membrane  ionic  currents  and  their  dependence  on  transmembrane  potential.  Starting  with  a 
resting membrane,  in  dynamic  equilibrium,  they  increased  the  transmembrane  potential  until  the  cell  became 
active* deriving  equations  relating  the  time-varying  ionic  currents  and  the  membrane  potentials.  Their  work 
has  lead  to  a  deepening  quantitative  understanding  of  transmembrane  dynamics  [Chay  and  Keizer  1983] 
[Chay  and  Rinzel  1985].  The  stochastic  gating  mechanism  of  individual  ion  channels  is  now  understood  so 
that  this  near  molecular  analysis  ties  in  with  effects  at  the  cellular  level  [Plonsey  and  Barr  1988],  This  direct 
contact  effect  has  in  turn  lead  to  more  recent  experiments  trying  to  elucidate  the  corresponding 
transmembrane  ion  dynamics  when  cells  are  exposed  to  external  E-fields  [Ehrenberg  et  al.  1987], 

3.2  Biogenic  E-field  Effects1. 

Growth  and  development  of  cells  is  thought  to  be  partly  controlled  by  bioelectric  fields.  While  the 
strong  effects  of  electrophysiology  are  theoretically  and  experimentally  indisputable,  the  weaker  effects  of 
biogenic  E-fields  are  yet  to  be  fully  understood  although  recent  efforts  are  progressing  our  understanding. 
Some  effects  may  well  depend  upon  the  flow  of  ionic  currents  near  cells  [Jaffe  1979]  and  upon  protein 
diffusion  within  plasma  membranes  [Poo  1981}. 

In  a  recent  study,  spherical  Mougeotia  protoplasts,  originally  cylindrical,  were  exposed  to  E-fields 
of  20  V  m_1  [White  et  al.  1990].  Over  some  hours,  the  protoplast  walls  regenerated  re-establishing  the 
original  shape.  How  regrowth  occurs  is  unclear;  but  one  possibility  concerns  electrophoresis  of  membrane 
proteins  [Poo  1981], 

Effects  of  exposure  to  static  E-fields  of  1000  V  m"1  upon  cytoskeleton  were  noted  in  experiments 
designed  to  observe  and  quantify  galvanotaxis  of  fibroblasts  [Hui  1994],  Changes  in  intracellular  calcium 
flux  were  measured  using  aequorin,  a  calcium  chemiluminescent  indicator  protein,  and  fiuo-3,  a  calcium- 
sensitive  fluorescent  dye.  Motility  of  the  fibroblasts  was  measured,  fluorescent  micrographs  showing 


'There  is  a  rich  history'  of  research  into  bioelectric  fields  and  cells  going  going  back  at  least  to  the  discovery 
of  the  cell  as  a  fundamental  unit  of  biology  by  Schwann  and  Schleiden  in  1847  [Lund  1947].  It  is  worth 
noting  that  since  Mesmer  in  the  1770's  to  our  own  time,  those  who  attempt  to  claim  biological  effects  due  to 
magnetism,  ('animal  magnetism'),  have  often  been  denounced  as  quacks  and  frauds,  m  total  contrast  with  the 
popular  success  of  experiments  by  Galvanni  and  Volta  to  demonstrate  'animal  electricity'  [Franke!  1986], 
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changes  in  actin-containing  stress  fibre  patterns  of  the  cytoskeieton  that  span  across  the  cell  body  and  are 
attached  via  proteins  to  the  inner  plasma  membrane. 


Applied  E-Field 


(a)  (b) 

Fig.  2  Development  and  growth  in  cells:  (a)  Asymmetric  growth  tip  of  seaweed  Fucus  embryo 
(b)  Symmetric  regrowth  of  Mougeotia  protoplast.  Symmetry  is  defined  wrt  left-right  shape. 


Fig.  3.  Ionic  currents  flowing  through  Mougeotia  protoplast.  Proteins  imbedded  within  the 
membrane  adapt  their  positions  by  diffusion  across  the  surface  when  exposed  to  electric  fields 
(electrophoresis).  These  proteins  presumably  act  as  ion  channels  enabling  currents  to  flow  into  and 
through  the  cell  interior.  Cylindrical  growth  is  intrinsically  symmetric,  hence  proteins  presumably 
move  to  either  end  depending  upon  polarity  to  enable  current  flow. 


3.3  Weak  E-field  Exposure  Effects. 


Over  the  past  two  decades,  a  number  of  reports  have  emerged  of  weak  extremely  low  frequency 
(ELF)  and  ELF  modulated  radiofrequency  (RF)  exposures  inducing  measurable  ionic  changes  in  cells. 
Windowed  effects  at  power  densities  less  than  1  mW  cirf“  across  both  ELF  and  RF  frequencies  have  been 
reported  [Adey  1980}  [Adey  and  Sheppard  1987]  [Bawin  et  al.  1989].  In  the  earliest  experiments,  chick 
brain  hemispheres  in  saline  solution  inside  a  test  tube  of  about  1.4  cm  diameter  were  exposed  to  ELF  and 
ELF  modulated  RF  fields  in  air  (see  Fig.  4a).  Extracellular  tissue  fields  at  ELF  are  estimated  to  be 
10'6V  m"*  (see  Fig.  4b),  and  the  corresponding  RF  levels  are  around  10  V  m'l. 

These  experiments  reportedly  result  in  frequency  dependent  changes  in  intra  and  extracellular 
calcium  ion  (efflux/influx)  concentration  levels  both  at  ELF  and  modulated  ELF  frequencies  [Bawin  et  al. 
1975]  [Bawin  and  Adey  1976]  [Liboff  et  al.  1987],  The  original  observations  in  which  in-vitro  specimens 
were  exposed  in  air  remain  unsubstantiated  as  replication  experiments  have  sometimes  failed  to  support  the 
primary  findings  [Schwartz  et  al.  1990]  [Wood  et  al.  1993]  raising  suggestions  that  there  is  no  actual  effect, 
only  experimental  artefact;  perhaps  some  other  parameter  apart  from  ELF  exposure,  eg  temperature,  is 
involved.  Since  the  observations  are  not  understood  mechanistically,  there  is  question  within  the  bioeffects 
community  about  the  validity  of  these  effects  at  weak  exposure  levels.  Further  controversy  exists  as  the 
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modulated  effects,  if  valid,  appear  at  levels  below  those  set  down  by  national  and  international  RF  exposure 
standards  [IRPA  1988]  [SAA  1990]  [ANSI  1991],  while  the  ELF  effects  occur  at  much  weaker  levels^  In  aU 
this  it  is  not  at  all  clear  that  any  actual  frequency  dependence  constitutes  a  health  hazard,  apart  from  being 
an  interesting  bioeffect.  Public  concern  nevertheless  demands  further  studies  that  will  hopefully  le^  to  a 
clear  understanding. 


Membrane 


Region  A:  50 
Region  B:  1 
Region  C:  3  mV/mN 


(*)  (b) 

Fig.  4  (a)  Schematic  of  in-vitro  ELF  exposure  set-up  (E~50  Vm’  > 

(b)  Fields  inside  extracellular  medium  and  cell  membrane  due  to  50  V  m*1  E-field  in  air 


4.  Conclusion 

At  this  point  in  time,  there  is  a  need  for  computational  methods  to  be  developed  to  provide  tools  to 
assist  the  theory  and  understanding  of  microbiological  processes,  and  then  to  provide  substantiation  or 
otherwise  for  any  cellular  bioeffects. 

Muiti-paitide  methods  seem  appropriate  as  collective  behaviour  of  charged  particle^  eg.  io«  aod 
proteins,  and  polarizable  particles,  in  particular  cells,  can  be  important  m  btotogy.  Stmple  muht-cell  metM 
Ling  continuum  (finite  difference)  equations,  along  tsith  a  simple  but  speciahzed  grid  m«pa>Ug  (finite) 
periodic  boundary  conditions  might  permit  feasible  computations.  Using  appropriate  spec.al  purpose  bu 
relatively  simple  multi-particle  methods,  cellular  effects  may  be  studied. 

(1)  By  treating  cellular  tissues  as  suspended  cells  in  an  extracellular  fluid,  much  as  Maxwell  did,  but 
using  numerical  techniques  based  on  multi-particle  methods,  it  may  be  possible  to develop 

models  that  reflect  fields  in  tissues  that  behave  as  expected  and  as  measured  poster  and  Schwan  1986] 
[Pethig  1988]  Such  models  would  involve  ions  diffusing  within  the  extracellular  and  intracellular  media. 
Lid  proteins  diffusing  within  the  plasma  membrane,  and  would  involve  ion  transport  across  ^ 
surfaces  By  allowing  both  the  extracellular  fluid  and  the  charged  particles  on  and  within  the  cells  and  the 
SlkSernSLes  the  freedom  to  move  as  a  function  of  time,  models  of  fluids  may  also  be  modelled  and 
tested  against  measured  data.  Polarization  of  individual  cells,  and  cell-cell  effects,  such  as  dipolar  rotation 
and  alignment,  can  be  studied,  and  theory  tested  and  improved. 

(2)  Biological  processes  such  as  growth  and  development  of  cells  where  biogenic  E-fieids  processes 
are  thought  to  be  involved  could  be  modelled.  The  time-evolutionary  way  in  which  biogenic  fields  are 
established  in  multi-cell  tissues  might  be  studied. 


648 


(3)  Effects  of  chemical  agents  might  be  modeiied  as  volumes  of  electrolytic  fluids  in  which 
concentration  gradients  cause  flows  of  ionic  currents.  Similarly,  application  of  externally  applied  ELF  or  RF 
exposures,  or  development  of  biogenic  E-fields  would  involve  flows  of  the  charged  and  dipolar  particles. 

(4)  Bioeffects  due  to  ELF  or  RF  exposures  might  be  studied  using  a  two-tiered  approach  involving 
two  levels  of  models,  one  overall  macroscopic  model  and  one  or  more  microscopic  model.  The  E-  and  B- 
field  exposures  in  the  microscopic  studies  could  be  estimated  using  macroscopic  calculations;  realistic 
models  (eg  using  MRI  scans)  of  the  whole  biological  entity  can  give  detailed  exposure  levels  within  different 
macroscopic  tissues.  Knowledge  of  the  level  of  exposure  within  each  microscopic  model  would  come  from 
this  overall  model.  Thus,  the  overall  problem  might  be  solved  using  a  two-tiered  numerical  approach,  a 
macroscopic  calculation  combined  with  ensuing  microscopic  calculations  to  determine  cellular  effects.  Use 
of  finite  periodic  boundary  conditions  could  allow  the  macroscopic  fields  to  be  incorporated  as  a  variable 
far-field  in  the  microscopic  model.  This  would  allow  the  inhomogeneity  of  the  macroscopic  fields  to  take 
effect  at  the  cellular  level  within  the  microscopic  models. 
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Abstract 

The  Specific  Absorption  Rate  (SAR)  in  the  head  of  a  mobile  phone  operator  is  investigated  using 
both  numerical  and  experimental  dosimetry1 .  A  box  and  a  sphere  phantom  (test  phantoms)  as  well  as 
a  female  phantom  are  used,  with  a  dipole  antenna  and  a  mobile  phone  as  RF  sources.  The  Method 
of  Moments  (surface  equivalence  principle)  is  employed  for  the  simulations  with  geometrically  accurate 
numerical  models  representing  the  human  phantom  and  antennas.  For  the  measurements,  fibre-glass 
phantom  casings  are  filled  with  liquid  based  equivalent  biological  material.  Near  field  and  SAR  results 
obtained  from  measurements  and  simulations  are  compared. 


1  Introduction 

The  popularization  of  mobile  phone  communication  through  the  cellular  phone  industry  in  the  past  few  years  has, 
among  other  things,  increased  awareness  of  the  possible  health  risk  involved  in  using  a  mobile  phone.  In  the  past 
five  years  various  international  research  groups  and  institutions  have  been  involved  in  research  activities  related 
to  the  possible  hazardous  effect  of  mobile  phones  on  human  operators.  The  nature  of  the  problem  is  such  that 
the  activities  can  be  sub-divided  into  three  main  fields  of  research:  1)  Dosimetry  research:  This  field  of  research 
can  generally  be  described  as  the  calculation,  deduction  or  measurement  of  the  electromagnetic  fields  penetrating 
into  the  head  of  a  human  operator  or  the  power  dissipation  associated  with  the  penetrating  fields;  2)  Biological 
research:  This  involves  studies  performed  on  biological  tissues  or  animals  when  exposed  to  non-ionizing  radiation 
at  cellular  phone  frequencies;  3)  Epidemiological  research:  Epidemiological  studies  involve  the  statistical  evaluation 
of  the  association  between  disease  (or  illness)  and  exposure  to  electromagnetic  fields  at  mobile  phone  frequencies. 

An  excellent  summary  of  biological  and  epidemiological  research  and  results  can  be  found  in  reference  [1].  The 
work  presented  in  this  paper  falls  under  the  dosimetry  research  category.  The  energy  absorption  in  the  head 
of  a  mobile  operator  is  investigated  using  both  experimental  (measurements)  and  numerical  dosimetry.  The 
investigation  is  limited  to  dosimetry  involving  homogeneous  phantoms  (equivalent  muscle  tissue  only)  in  free- 
space.  SAR  results  for  inhomogeneous  phantoms  (muscle,  brain  and  bone  tissue)  in  free-space  as  well  as  in  a  motor 
vehicle  can  be  found  in  reference  [2]. 

Section  2  of  this  paper  describes  the  establishment  of  the  measurement  setups  as  well  as  the  general  preparations 
for  performing  experimental  dosimetry.  Equivalent  biological  materials  were  prepared  using  recipes  found  in  the 
literature  and  the  properties  of  the  materials  (permitivity  and  conductivity)  were  measured  for  validation  purposes. 
Fibre-glass  based  phantoms  were  filled  with  equivalent  tissues  which  were  prepared  in  a  liquid  based  (gel-like)  form. 
A  box  and  a  sphere  phantom  were  used  as  test  phantoms  with  a  dipole  antenna  with  balun  as  RF  source.  Field 
measurements  were  performed  inside  the  equivalent  tissue  material  in  the  test  phantoms.  From  this  the  SAR 
(Specific  Absorption  Rate)  can  be  extracted.  A  female  phantom  filled  with  muscle  tissue  was  used,  with  a  dipole 
antenna  and  a  mobile  phone  (with  extended  and  short  antenna  arms)  as  RF  sources. 

Section  3  presents  the  technical  aspects  of  numerical  dosimetry  and  considers  some  results  obtained  as  well  as 
a  technical  interpretation  thereof.  Geometrically  accurate  numerical  models  of  the  female  phantom  as  well  as  the 

1The  word  dosimetry  has  been  derived  from  the  dosimetric  concept  (where  the  ‘dose’  is  defined  as  the  energy  per  unit  mass)  used 
by  international  regulating  bodies  for  protection  from  non-ionizing  radiation. 
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dipole  and  mobile  phone  were  developed  and  used  with  the  MoM  simulations.  Method  of  Moments  (MoM)  results 
for  the  test  phantoms  as  well  as  the  female  phantom  are  compared  to  measurements. 

The  emphasis  of  this  paper  is  on  the  comparison  between  measured  and  calculated  field  values  (and  related  SAR 
values)  inside  the  phantoms.  The  methodology  employed  with  measurements  and  numerical  simulations  will  also 
be  considered.  Due  to  the  sensitive  nature  of  the  topic,  the  absolute  SAR  values  obtained  will  not  be  presented 
here. 


2  Experimental  Dosimetry 

2.1  Phantom  development 

Three  types  of  equivalent  biological  tissue  materials  were  prepared  namely  muscle,  brain  and  bone  tissue  (see 
reference  [2]).  The  work  presented  in  this  paper  concerns  measurements  and  simulations  using  only  muscle  tissue 
(homogeneous  phantoms). 

A  coaxial  measurement  system  in  conjunction  with  a  Baker-Jarvis  extraction  algorithm  [3]  was  used  to  measure 
the  electrical  properties  of  all  the  synthetic  tissue  equivalents.  The  measured  material  parameters  of  the  equivalent 
biological  tissue  are  given  in  table  1.  These  values  compare  reasonably  well  with  those  published  in  the  literature  [4] 
(a  5-10%  difference  for  muscle  and  brain  tissue).  The  only  exception  is  the  conductivity  of  the  equivalent  bone 
tissue  (see  reference  [2]  for  more  detail). 


Table  1:  Material  parameters  of  simulated  human  tissue  at  900MHz. 


Tissue 

Type 

Measured 
er  a  [S/m] 

From  Literature 
er  a  [S/m] 

Muscle 

51.75 

1.25 

54.7 

1.38 

Brain 

38.25 

1.10 

41.2 

1.22 

Bone 

7.89 

0.072 

7.4 

0.16 

Fibre-glass  phantom  casings  were  manufactured  and  used  as  containers  of  the  equivalent  tissue  (prepared  in  liquid 
form)  when  measurements  were  performed.  The  rectangular  shaped  box  phantom  (31cm  x  23cm  x  10cm)  and  the 
spherical  phantom  (r=9.4cm)  are  test  phantoms  used  for  preparation  (setting  up  the  measurement  procedures) 
and  validation  purposes.  The  third  and  fourth  casings  are  fibre-glass  shells  of  a  female  (see  figure  2)  and  a  male 
human  being.  The  female  and  male  phantoms  were  constructed  using  a  mannikin.  These  human  phantoms  were 
filled  with  equivalent  muscle  tissue  material  when  measurements  were  performed.  Only  the  results  for  the  box, 
sphere  and  female  phantoms  will  be  considered  in  this  paper. 

2.2  Measurements  at  the  US  Setup 

An  experimental  dosimetry  setup  in  the  Anechoic  chamber  at  the  Department  of  Electrical  and  Electronic  engi¬ 
neering,  University  of  Stellenbosch  (the  US  setup)  has  been  established.  A  network  analyzer  was  used  to  perform 
measurements  on  a  two-port  system. 

The  probe  used  is  an  electrically  small  dipole  antenna.  The  probe  consists  of  a  Marchand  balun  with  two 
semirigid  coaxial  waveguides  extruding  one-and-a-quaxter  wavelengths  out  of  the  balun.  This  ensures  a  balanced 
dipole  probe  with  reasonable  matching  properties.  The  probe  movement  is  controlled  by  the  automated  x-y 
positioning  system,  and  a  semi-automated  third  axis  (z-axis)  positioning  arm. 

As  the  first  “test”  source  antenna,  another  dipole  was  used  (see  figure  1),  also  fed  with  a  Marchand  balun.  This 
antenna  is  resonant  at  900MHz.  A  mobile  phone,  with  extended  and  short  antenna  arm,  was  converted  so  that 
it  could  be  used  as  a  source  antenna  for  the  human  phantom  measurements.  A  thin  50-ohm  semi-rigid  cable  was 
guided  inside  the  phone  from  the  feed-point  of  the  antenna  to  a  suitable  area  for  a  connector  to  be  attached.  Su 
(with  antennas  in  position  next  to  phantom  head),  radiation  patterns  and  gain  of  the  dipole  and  the  mobile  phone 
were  measured.  Su  together  with  the  internal  losses,  determines  the  actual  power  transmitted  by  the  antennas. 
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In  the  measurement  setup,  the  transmitting  antenna  is  connected  to  port  1  of  the  network  analyser  and  the  probe 
to  port  2.  The  measured  data  obtained  from  the  network  analyser  is  821-  Using  S21  and  the  measured  antenna 
characteristics,  the  electric  field  components  at  the  measurement  position  can  be  computed  [2]  for  a  specified  input 
power.  A  SAR  value  can  then  be  extracted  from  this  vector  E-field  using  the  following  equation  [1]: 


SAR  = 


1  o\E? 

2  P 


(1) 


with  ex  the  conductivity  (in  S/m)  and  p  the  density  (in  kg/m3)  of  the  equivalent  biological  material.  Both  a 
(see  table  1)  and  p  (1.30  g/cm3)  were  measured  for  the  equivalent  muscle  tissue. 

Measurements  were  performed  using  the  sphere,  box  and  female  phantoms  at  the  US  Setup.  For  the  box  phantom 
the  dipole  antenna  was  positioned  with  dipole  arms  3.5cm  below  the  box  with  the  dipole  feed  point  (voltage  gap) 
aligned  with  the  centre  of  the  box.  The  dipole  arms  were  parallel  to  the  longest  sides  (31cm  sides)  of  the  box.  The 
probe  penetrated  into  the  box  from  the  top.  Measured  results  along  a  line  across  the  box,  1cm  from  the  bottom, 
are  presented  in  figure  4. 

A  3cm  long,  3mm  wide  slot  was  cut  out  of  the  sphere  phantom.  This  slot  was  used  to  fill  the  sphere  with  muscle 
tissue.  The  dipole  antenna  was  positioned  1cm  from  the  bottom  of  the  sphere.  The  dipole  probe  penetrated  into 
the  sphere  through  the  slot  (mentioned  above)  cut  out  of  the  “top”  of  the  sphere.  This  allows  for  movement  of 
the  probe  in  the  vertical  direction  only.  Measured  results  along  a  line  starting  at  the  bottom  of  the  sphere  and 
extending  along  a  centre-line  to  the  top  of  the  sphere  are  shown  in  figure  5. 

Figure  2  shows  the  female  phantom  measurement  at  the  US  setup.  In  order  to  measure  both  polarizations  of 
the  electric  field,  all  measurements  were  repeated  with  the  dipole  probe  aligned  in  both  the  x-  and  z-directions. 
Figures  6  and  7  show  the  SAR  values  along  a  line  penetrating  vertically  into  the  head  of  the  female  phantom.  The 
dipole  and  mobile  phone  with  extended  antenna  were  used  for  these  measurements. 


2.3  The  EMSS  Setup 

An  experimental  dosimetry  setup  has  been  established  at  the  offices  of  EM  Software  &  Systems.  The  differences 
between  the  US-Setup  and  the  EMSS-Setup  lie  mainly  in  the  methodology  of  probing  the  electric  field  inside  the 
phantoms.  Figure  3  shows  a  diagram  of  the  EMSS  measurements  setup. 

At  the  EMSS  setup  a  signal  generator  (900  MHz)  is  connected  to  a  13  Watt  amplifier.  The  output  of  the  amplifier 
is  connected  to  the  RF  source  antennas  (dipole  or  mobile  phones).  A  diode-based  probe  has  been  designed  and 
manufactured  for  this  project.  The  900MHz  electric  field  at  the  probe  tip  is  converted  to  a  DC-signal  by  the 
diode.  The  voltage  at  the  end  of  the  DC-lines  is  a  function  of  the  electric  field  at  the  probe  tip.  A  low  pass  filter 
reduces  any  unwanted  signals  received  along  the  data  acquisition  lines  which  might  be  picked  up  by  the  diode  and 
converted  to  a  DC-signal.  (The  detail  workings  of  such  a  probe  is  discussed  in  references  [5]). 

The  probe  developed  is  polarized  and  yields  only  one  of  the  three  spatial  components  of  the  electric  field.  A 
second  component  can  be  measured  by  rotating  the  probe  by  90  degrees  and  repeating  the  measurement.  The  third 
component  cannot  be  measured  with  the  current  probe.  The  probe  positioning  at  the  EMSS  facility  is  controlled 
by  a  three-axis  positioning  system  —  two  PC  controlled  and  one  manual  controlled  to  within  2mm  positioning 
accuracy. 

For  probe  calibration,  temperature  measurements  were  performed  in  the  box  phantom  filled  with  muscle  tissue 
(see  figure  4).  A  thermocouple  was  used  as  temperature  probe  with  the  same  data  acquisition  electronics  used  for 
the  diode-based  electric  field  probe.  More  detail  on  probe  calibration  can  be  found  in  references  [1,  pp.36]  and  [2]. 

SAR  values  can  be  obtained  from  the  electric  field  measurements  at  the  EMSS  setup  using  equation  1.  Mea¬ 
surements  were  performed  at  the  EMSS  setup  using  the  box  and  homogeneous  female  phantoms.  The  sensitivity 
of  the  probe  (or  lack  thereof)  and  the  DC-voltage  noise  level  (caused  by  various  devices  in  the  system)  were  the 
two  main  reasons  causing  unreliable  measured  results  in  regions  of  low  SAR  values.  It  was  thus  not  possible  to 
perform  accurate  measurements  in  the  sphere  phantom. 


3  Numerical  Dosimetry 

The  method  of  moments  (MoM)  [6]  as  implemented  in  the  computer  code  FEKO  [7]  is  the  numerical  technique 
used  for  the  simulations.  The  Finite  Element  Method  and  Finite  Different  Time  Domain  are  two  other  techniques 
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suitable  for  SAR  calculations.  Technical  information  on  the  MoM,  FEM  and  FDTD  techniques  as  SAR  prediction 
tools  can  be  found  in  reference  [1]. 

3.1  MoM  implementation  in  FEKO 

With  the  MoM  implementation  in  FEKO  metallic  surfaces  are  subdivided  into  triangular  patches  and  the  treatment 
is  similar  to  reference  [8],  with  some  minor  modifications.  Metallic  wires  and  the  connection  of  wires  with  surfaces 
are  also  supported.  These  features  were  used  for  modelling  the  dipole  and  mobile  phone  antennas. 

Several  possibilities  exist  in  order  to  consider  dielectric  bodies  within  the  framework  of  the  MoM:  Application 
of  the  volume  or  the  surface  equivalence  principles.  Both  techniques  are  available  in  FEKO,  but  for  this  paper 
the  surface  equivalence  principle  was  chosen  because  of  the  requirement  of  a  3-D  discretization  for  the  volume 
equivalence  principle,  the  memory  requirement  is  too  high  at  the  frequency  range  under  consideration.  The  surface 
equivalence  principle,  on  the  other  hand,  requires  only  a  2-D  discretization  of  the  scatterer’s  surface,  but  limits 
the  application  of  the  MoM  to  partly  homogeneous  bodies.  This  is  acceptable  for  the  current  application  to 
homogeneous  phantoms  filled  with  muscle  tissue. 

Similar  to  metallic  structures,  the  surface  of  the  dielectric  body  is  also  represented  by  triangular  patches  (see 
figure  4)  and  the  unknown  equivalent  surface  current  densities  of  electric  and  magnetic  type  are  expressed  by  the 
superposition  of  basis  functions: 


N  N 

=!>*/«  =  (2) 

n=l  n=l 

The  same  rooftop  basis  functions  /„  used  for  metallic  surfaces  are  also  applied  to  model  the  equivalent  electric 
current  density  J.  Depending  on  the  kind  of  integral  equation  (EFIE,  MFIE,  CFIE,  PMCHW,  Muller,  see  e.g. 
[9,  10]  for  more  details)  the  simple  selection  gn  =  fn  leads  to  singular  matrices  of  the  system  of  linear  equations. 
An  alternative  for  gn  was  proposed  in  [11].  However,  for  the  computation  of  peak  SAR  values  right  at  the  surface 
of  the  dielectric  body,  it  turned  out  that  these  basis  functions  ,  lead  to  some  oscillating  behavior  in^the  scattered 
electric  near-field  very  close  to  the  dielectric/air  interface.  Alternative  basis  functions  gn  =  h  x  fn  proposed  in 
[12,  13]  are  therefore  used  in  this  paper  in  conjunction  with  the  EFIE  (Galerkin  method). 

3.2  Models  of  the  source  antennas 

The  dipole  antenna  was  modelled  as  two  metallic  arms  with  a  voltage  gap.  The  inclusion  of  the  balun  casing  in 
the  MoM  model  was  essential.  The  voltage  gap  source  model  described  above  ensures  a  balanced  dipole  feed.  It 
was  thus  not  necessary  to  include  the  balun  network  into  the  models,  only  the  geometry  of  the  balun  casing. 

The  mobile  phone  antenna  was  modelled  using  two  flat  metallic  boxes,  representing  the  cellphone  base  and 
battery,  and  a  metallic  rod  representing  the  extended  or  short  antenna  arms.  A  voltage  gap  feed  segment  (connected 
on  the  one  end  to  the  cellphone  base  and  on  the  other  end  to  the  extended  arm)  was  used.  The  mobile  phone  models 
developed  for  the  MoM  simulations  are  reasonably  accurate  when  radiation  patterns  are  compared.  However,  there 
is  certainly  room  for  improvement. 

3.3  Phantom  models 

The  MoM  sphere  and  box  phantoms  were  created  using  FEKO’s  sphere  and  quadrangular  element  geometric  control 
cards.  The  surface  equivalence  principle  is  used  (see  section  3.1  above)  with  the  appropriate  material  parameters 
(er  and  a)  for  the  enclosed  dielectric  region.  The  MoM  requires  only  a  surface  mesh  which  was  discretised  as  fine 
as  0.1  Ar  maximum  triangular  element  side  length  (with  Ar  the  wavelength  inside  the  dielectric  region).  The  dipole 
antenna  was  used  with  the  sphere  and  box  phantoms  which  allowed  for  the  utilization  of  symmetry  in  the  FEKO 
models. 

Accurate  numerical  models  of  the  female  phantom  was  created  from  CAT  scans  of  the  phantoms  and  a  CAT-scan- 
to-Polygon-shaped  conversion  program  developed  for  this  project.  The  coordinates  of  the  polygons  representing 
the  various  cross-sections  of  the  head  (at  different  levels)  were  used  as  nodal  points  on  a  surface  mesh  consisting 
of  triangles.  The  triangles  were  written  to  a  STL  (Stereolithography)  format  for  inclusion  in  FEKO. 
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The  female  phantom  model  for  the  MoM  is  shown  in  figure  7.  This  model  consists  of  2294  triangular  elements 
(6882  unknowns)  on  the  surface  of  the  phantom  head.  This  is  very  close  to  the  largest  dielectric  body  (consisting 
of  muscle  tissue)  which  can  be  solved  at  900MHz  with  the  available  computer  hardware  . 

3.4  Computer  simulated  results 

In  this  section  we  will  consider  the  results  obtained  with  the  numerical  phantom  models  described  above.  The 
results  will  be  compared  to  measurements  at  the  two  experimental  dosimetry  setups.  Note  that  all  SAK  results 
have  been  normalized  such  that  the  maximum  value  on  each  figure  is  equal  to  1  [W/g].  The  comparison  between 
measurements  and  simulations  is  thus  presented  without  presenting  the  absolute  SAR  values. 

Results  obtained  with  the  box  phantom  are  compared  to  measurements  in  figure  4.  There  is  good  agreement  in 
general.  The  MoM  seems  to  slightly  over-predict  SAR  values  when  compared  to  both  sets  of  measurements.  The 
shape  of  the  SAR  predictions  from  the  temperature  measurements  is  clearly  not  as  smooth  as  would  be  expected. 
The  temperature  measurement  at  the  centre  point  of  the  line  was  used  for  calibrating  the  diode-based  probe. 

With  the  sphere  phantom  simulations  convergence  test  results  showed  that  the  MoM  models  were  discretised  fine 
enough  using  Ar/5  maximum  triangle  edge  length  on  the  surface  of  the  sphere.  The  agreement  between  measured 
and  MoM  simulated  results  is  excellent  (see  figure  5).  The  position  as  well  as  the  depth  of  the  nulls  is  almost 

ld  Fo^the  numerical  simulations  involving  the  homogeneous  female  phantoms,  material  parameters  for  equivalent 
muscle  tissue  (er  =  51.75  and  a  =  1.25  —  see  table  1)  were  used.  Figure  6  shows  SAR  results  on  a  vertical  line 
starting  close  to  the  antenna  and  penetrating  into  the  phantom  head.  This  is  for  a  homogeneous  female  phantom 

with  dipole  antenna.  , 

The  MoM  results  (complete  head)  agree  well  with  the  measurements  at  the  US  Setup  except  close  to  the  side 
of  the  head  furthest  from  the  antenna.  A  spatial  shift  is  observed  in  the  null.  The  reason  for  this  is  that  the 
complete  female  phantom  head  is  used  with  the  simulations  while  the  female  phantom  is  not  completely  filled  with 
equivalent  muscle  tissue  when  measurements  are  performed  (see  figure  2).  The  null  seems  to  arise  from  an  out  of 
phase  reflection  originating  at  the  material  discontinuity  where  the  muscle  tissue  ends,  and  the  air  starts  again 
The  null  would  certainly  be  expected  to  shift  if  the  position  of  the  material  discontinuity  shifts.  The  MoM  partial 
head  result  on  the  same  figure  shows  how  the  null  moves  close  to  the  measured  null  when  a  partial  female  phantom 

head  is  simulated.  ,  ,  ,  ,  ,  , 

Figure  7  shows  SAR  results  on  a  vertical  line  penetrating  into  the  phantom  head  of  a  homogeneous  female 
phantom  with  mobile  phone  antenna  (extended).  The  MoM  results  are  for  a  partial  phantom  head  The  MoM 
results  compare  well  with  the  measured  results  of  the  US  Setup.  The  EMSS  results  compare  well  with  the  US  Setup 
measurements  and  the  simulations  in  the  region  close  to  the  antenna.  No  EMSS  measurements  were  performed 
deeper  into  the  phantom  head  due  to  the  relatively  high  noise  level  of  the  diode  based  probe  at  the  EMSS  setup. 
The  discontinuity  in  the  EMSS  measurements  at  approximately  2.5cm  into  the  head  is  where  the  SAR  values  are 
below  the  current  noise  level. 


4  SAR  distribution  in  the  phantom  head 

The  numerical  simulations  can  be  used  to  calculate  the  SAR  values  on  a  3D  grid  inside  the  phantom  head.  This 
allows  one  to  compare  the  SAR  distribution  in  the  phantom  head  for  different  RF  source  antennas.  (This  would 
also  be  possible  for  measurements  if  a  3D  grid  of  measured  data  is  available.)  Figures  8  to  10  show  some  of  the 
SAR  distribution  results  in  the  head  of  the  human  phantoms  using  different  source  antennas.  It  is  clear  from  these 
and  other  results  not  presented  that  the  distributions  differ  for  the  different  source  antennas.  The  SAR  is  certainly 
distributed  more  smoothly  along  the  inside  of  the  head  with  the  extended  mobile  phone  antenna  than  with  the 

short  antenna.  _ 

2 A  512M  RAM,  200MHz,  dual  processor,  Pentium  Pro  PC 
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5  Conclusions 


Two  experimental  dosimetry  setups  have  been  established.  At  the  US  setup  measurements  were  performed  inside 
a  completely  shielded  anechoic  chamber  using  a  small  dipole  probe  and  a  network  analyser.  The  probe  penetrated 
into  equivalent  biological  tissue  contained  in  a  fibre-glass  phantom  head  (muscle  tissue). 

At  the  partially  shielded  EMSS  setup  temperature  measurements  were  performed  inside  equivalent  muscle  tissue 
to  calibrate  a  diode-based  E-field  probe.  This  field  probe,  with  data  acquisition  electronics  and  software,  was  then 
used  to  perform  electric  field  measurements  inside  a  female  phantom  filled  with  muscle  tissue.  From  these  electric 
field  measurements,  SAR  values  could  be  obtained. 

Computer  models  of  the  human  phantoms  were  created  using  CAT  scans  and  geometrical  coordinate  extraction 
software.  Models  of  the  mobile  phone  antennas  were  also  created.  Numerical  simulations  were  performed  using 
the  Method  of  Moments.  From  these  results  it  is  possible  to  extract  various  parameters  including  the  SAR  values 
inside  the  phantoms. 

SAR  results  were  compared  with  each  other  to  investigate  the  validity  of  the  techniques  employed.  It  was  not 
possible  to  investigate  all  the  different  scenarios  in  which  a  mobile  phone  can  be  used  by  an  operator.  The  three 
antennas  (dipole,  mobile  phone  with  extended  arm  and  with  short  arm)  were  each  positioned  in  only  one  typical 
operational  position  next  to  the  phantom  head. 

It  should  be  noted  that  the  hand  of  a  user  could  absorb  a  significant  percentage  of  the  power  transmitted  from 
a  mobile  phone.  The  hand  could  also  alter  the  effectiveness  of  the  phones  resulting  in  changes  in  the  SAR  levels 
as  well  as  the  SAR  distribution  in  the  head.  No  hand  was  included  in  any  of  the  measurements  or  simulations  and 
this  is  an  important  improvement  which  could  be  considered  for  future  work. 

There  is  some  room  for  improvement  of  the  measurement  equipment  as  well  as  the  computer  models  but  the 
agreement  between  simulations  and  measurements  are  in  general  very  good.  A  reliable  and  accurate  experimental 
and  numerical  dosimetry  capability  has  been  establish  that  could  serve  as  a  basis  from  which  to  address  more 
challenging  dosimetry  problems,  e.g.  low  SAR  antenna  design. 
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_  „  _  ,  Figure  8:  SAR  distribution:  female  phantom,  dipole  an- 

Figure  6:  Comparison  of  MoM  simulations  and  US  Setup  tenna. 
measurements  on  a  vertical  line  penetrating  into  the 


head  of  a  homogeneous  female  phantom. 
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Abstract- This  paper  examines  the  near  field  of  a  portable  radio  handset  at  850  MHz  operating  near 
a  model  of  the  human  head.  The  handset  and  head  are  solved  by  the  finite-difference  time-domain 
method.  For  the  handset  in  isolation,  the  near  field  computed  with  FDTD  is  compared  with  that 
computed  with  the  Numerical  Electromagnetics  Code,  with  excellent  agreement.  The  near  field  was 
measured  using  a  three-axis  probe  positioned  with  a  planar  scanner  system  in  an  anechoic  chamber. 
The  FDTD  computation  is  compared  with  the  measured  near  field  and  very  good  agreement  is 
shown.  Introducing  a  box  or  sphere  model  of  the  head  changes  the  field  of  the  handset.  The 
measured  near  field  of  the  handset  and  box  head,  and  of  the  handset  and  sphere  head  are  shown  to 
be  in  good  agreement  with  the  FDTD  computation. 


Introduction 

Fig.  1  shows  a  portable  radio  “handset”[l]  comprised  of 
an  aluminum  box  5.35  by  1.74  by  16.77  cm,  with  an  antenna 
8.82  cm  in  length,  located  0.87  cm  from  three  adjacent  edges  of 
the  case.  The  antenna  is  a  quarter-wave  monopole  operating  at 
850  MHz.  In  this  paper  we  investigate  the  vertical  component  of 
the  near  field  of  the  handset  in  an  xz  plane  located  at  y=-21.8  mm 
in  Fig.  1,  where  the  base  of  the  antenna  is  at  the  origin.  The 
objective  is  to  validate  computations  of  the  near  field  done  with 
the  finite-difference  time-domain(FDTD)  method[l,2]. 

Near  Field  of  the  Handset  Alone 

Using  a  cell  size  of  2.205  cm,  the  handset  case  is  very 
close  to  24  by  8  by  76  cells,  and  the  antenna  is  40  cell  edges  in 
length.  The  perfectly-matched  layer(PML)  absorbing 
boundary[2]  is  used,  6  cells  in  thickness  with  a  surface  reflection 
coefficient  of  0.001  and  parabolic  evolution  of  the  conductivity. 
The  handset  is  embedded  in  a  cell  space  96  by  88  by  188  cells 
with  the  base  of  the  antenna  on  cell  edge  41,  41,  113.  The 
surfaces  of  the  handset  and  the  tip  of  the  antenna  are  separated 
from  the  first  layer  of  the  PML  by  30  cells  of  free  space  or 
“whitespace”.  In  all  cases  in  this  paper  the  FDTD  computation  is 


Fig.  1  The  portable  radio 
handset  in  the  xyz 
coordinate  system. 
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run  for  4096  time  steps  with  a  sinusoidal  generator  at  850  MHz  at  the  base  of  the  monopole.  The 
FDTD  code  is  used  to  compute  a  set  of  “conical  cut”  radiation  pattems[l]  that  cover  the  surface  of 
the  radiation  sphere.  The  radiated  power  is  computed  from  the  patterns  as 


Prad 


and  the  fields  presented  in  this  paper  are  scaled  to  correspond  to  a  radiated  power  of  600  mW. 

An  initial  validation  of  the 
FDTD  computation  was  obtained  by 
comparing  the  near  field  of  the 
handset  computed  with  FDTD  with 
that  found  using  a  wire-grid  model  of 
the  handset[l]  solved  with  the 
Numerical  Electromagnetics 

Code(NEC)[3].  Fig.  2  shows  the 
vertical  component  E.  of  the  near 
field  of  the  handset  in  an  xz  plane  in 
Fig.  1  at  y=-21.8  mm.  The  field  is 
shown  in  decibels  with  1  volt  per 
meter  as  the  0  dB  reference.  The 
black  contours  were  computed  with 
FDTD.  The  region  of  space  around 
the  edges  of  the  map  corresponding  to 
the  PML  layers  were  removed  from 
the  FDTD  data.  In  Fig.  2,  the  FDTD- 
computed  contours  are  superimposed 
on  the  near  field  computed  with  the 
wire-grid  model  of  the  handset,  shown 
in  gray.  There  is  excellent  agreement 
between  the  two  computations.  The 
near  field  is  large  surrounding  the  tip 
of  the  antenna,  the  top  of  the  case  and 
the  base  of  the  case.  At  the  surface  of 
the  case  the  Ez  component  of  the 
field  would  be  zero.  At  our  xz  plane, 

13.1  mm  from  the  case  surface,  the 
near  field  falls  to  values  below  24  dB 
over  the  center  of  the  case.  The 
frequency-domain,  moment-method 
based  NEC  computation  obtains 
almost  the  identical  near  fields  to  the 
time-domain  FDTD  method. 


661 


Measurement  of  the  Near  Field 

The  near  field  of  the  handset  was  measured[4]  in  an  anechoic  chamber  2.5  m  in  height  and 
3.74  m  in  width  and  depth.  The  ceiling  and  walls  were  lined  with  12  inch  thick  absorber.  The 
probe  was  moved  over  the  measurement  plane  using  an  Orbit  Advanced  Technologies  planar 
scanner.  The  scanner  was  covered  with  12  inch  absorber  as  much  as  possible.  The  positioning 
mechanism  was  covered  with  three-inch  absorber. 

A  “DASY-2”  three-axis  probe[5]  uses  three  small,  orthogonal  dipoles  to  measure  the 
magnitude  of  each  of  the  three  components  of  the  near  field.  The  dipoles  are  terminated  with 
Schottky  diodes,  which  are  connected  with  high-resistance  lines  to  a  battery-operated  data 
acquisition  unit  at  the  base  of 
the  probe.  The  signals  are  then 
sent  via  a  fiber-optic  cable  to  a 
computer  plug-in  card.  We 
note  that  the  probe  does  not 
measure  Ex ,  Ey ,  and  E,  at 
exactly  the  same  point. 

Components  Ex  and  Ez  are 
measured  at  slightly  offset 
points  in  the  same  xz  plane; 
whereas  Ey  is  measured  in  an 
xz  plane  1.1  millimeters  further 
from  the  tip  of  the  probe. 

The  850  MHz  oscillator 
in  the  handset  is  powered  by  a 
rechargeable  battery,  with  a  life 
of  about  2  hours,  sufficient  to 
measure  a  plane  of  the  size 
presented  in  Figs.  3,  4  and  5. 

The  oscillator  output  declines 
over  the  course  of  a 
measurement.  The  change  in 
field  strength  over  time  was 
monitored  and  recorded,  and  the 
measured  data  presented  in  this 
paper  has  been  compensated  to 
account  for  declining  battery 
power. 

Fig.  3  compares  the 
FDTD-computed(black)  and 
measured(gray)  contours  for  the 
handset  alone  at  y=-21.8  mm. 

The  measured  fields  must  be 
scaled  to  correspond  in  level  to 
the  computation.  This  was 
done  by  computing  the  RMS 
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field  strength  along  a  “tie  line”, 


where  L  is  the  length  of  the  tie  line.  The  tie  line  was  chosen  as  the  left-hand  edge  of  the  computed 
contour  map  in  Fig.  3,  at  *=-7.277  cm,  from  r=-18.01  cm  to  £=11.95  cm.  If  Ec  is  the  RMS  field 
along  the  tie  line  in  the  computation,  and  Em  the  RMS  field  in  the  measurement,  then  the  measured 
data  is  scaled  by  Ec/Em  .  Tie-line  normalization  aligns  the  measured  and  computed  contours  to 
approximately  the  same  level  at  the  left  edge  of  the  contour  map.  If  the  maps  are  in  good 
agreement,  the  remaining  contours  will  then  align  well. 

In  Fig.  3  the  computed  contours(black)  align  very  well  with  the  measured  contours(gray) 
around  the  tip  of  the  antenna  and  the  top  of  the  handset  case.  At  the  base  of  the  case  the  42  dB 
contour  is  smaller  in  the  measurement  than  in  the  computation.  Over  the  surface  of  the  case  there  is 
a  large  computed  contour  at  24  dB;  whereas  the  measured  contour  over  the  case  is  27  dB.  The 
measured  field  is  about  3  dB  stronger  over  the  case.  The  contours  to  the  left  of  the  antenna  and 
handset  case  align  quite  well,  particularly  near  the  top  of  the  handset  case.  In  the  top  right-hand 
comer  of  the  contour  map,  we  note  that  the  measured  contours  tend  to  lie  somewhat  further  from 
the  antenna  than  do  the  computed  contours. 


The  Handset  and  the  Box  Head 

The  “box  head”  is  a  plexiglas  box  17.07  by  13.89  by  21.06  cm,  with  a  wall  thickness  of  0.51 
cm.  The  box  is  filled  with  “brain  liquid”,  a  sugar,  salt  and  water  mixture  having  the  electrical 
parameters  of  brain  matter[6].  The  relative  permittivity  and  conductivity  of  the  brain  liquid  were 
measured  to  be  40.42  and  1.064  S/m,  respectively,  and  these  values  were  used  in  the  computations. 
With  the  base  of  the  antenna  in  Fig.  1  at  the  origin,  the  box  head  is  located  in  the  +y  halfspace  with 
one  comer  at  *=-6.73,  y=1.81  and  £=-16.77  cm.  The  bottom  of  the  handset  is  at  the  same  height  as 
the  bottom  of  the  box  head.  Using  2.205  mm  cells,  the  box  is  modeled  in  FDTD  with  78  by  63  by 
95  cells,  and  the  plexiglas  wall  is  two  cells  thick.  The  2.205  mm  cell  size  corresponds  to  one-tenth 
wavelength  in  the  brain  liquid  at  2125  MHz.  To  find  the  coefficients  for  updating  the  electric  field 
on  each  cell  edge,  the  FDTD  code  averages  the  permittivity  and  conductivity  of  the  four  adjacent 
cells.  The  FDTD  cell  space  was  terminated  with  a  PML  as  described  above.  A  bigger  cell  space 
was  used  for  the  box  head  problem,  149  by  147  by  188  cells,  which  puts  the  base  of  the  antenna  on 
cell  edge  67,  41,  113.  The  FDTD  model  was  run  for  4096  time  steps,  more  than  adequate  to  reach 
the  sinusoidal  steady  state. 

Fig.  4  shows  the  computed(black)  field  contours  for  the  vertical  component  of  the  near  field 
at  y=-21.8  mm.  This  xz  plane  is  on  the  opposite  side  of  the  handset  from  the  box  head.  Comparing 
Figs.  3  and  4  shows  a  42  dB  contour  surrounding  the  tip  of  the  antenna  for  both  the  handset  alone 
and  the  handset  and  box  head.  The  tip  is  well  above  the  top  of  the  box  head.  However,  the  contour 
for  the  handset  alone  near  the  top  of  the  handset  is  at  45  dB,  but  only  42  dB  for  the  handset  and  box 
head.  Similarly,  the  contour  at  the  bottom  of  the  handset  is  42  dB  for  the  handset  alone  but  only  39 
dB  for  the  handset  and  box  head.  The  computed  field  over  the  surface  of  the  handset  falls  to  values 
less  than  18  dB  in  Fig.  4,  much  less  than  the  field  of  24  dB  for  the  handset  alone  in  Fig.  3. 

Fig.  4  compares  the  computed(black)  and  measured(gray)  contours  for  the  vertical 
component  of  the  field.  The  measurement  was  scaled  to  match  the  level  of  the  computation  along  a 
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Distance  in  x  term 

Fig.  4  Comparison  of  the  computed(black)  and  measured(gray)  Ez  for  the  handset  and  box 
head,  at  21.8  mm  from  the  antenna. 

tie  line  along  the  left  edge  of  the  measured  contour  map,  at  *=-7.494  cm,  from  z=-18.01  to  z=11.95 
cm.  The  measurement  and  computation  agree  very  well  near  the  tip  of  the  antenna.  At  the  top  of 
the  handset  case,  the  computed  42  dB  contour  is  somewhat  smaller  in  the  computation  than  in  the 
measurement.  Conversely  at  the  base  of  the  handset  the  computed  39  dB  contour  is  larger  than  the 
measured  39  dB  contour.  Over  the  surfaces  of  the  handset  case,  the  measured  field  falls  to  values 
below  21  dB,  considerably  smaller  than  the  smallest  measured  contour  of  27  dB  for  the  handset 
alone  We  note  that  the  computed  field  falls  to  below  18  dB,  about  3  dB  less  than  the  measured 


field.  Also  the  location  of  the  smallest  fields  in  the  measurement  is  more  towards  the  bottom  of  the 
case  than  in  the  computation. 

Handset  and  Sphere  Head 

The  “sphere  head”  is  20.68  cm  in  diameter  with  a  wall  thickness  of  3.65  mm.  The  box  head 
and  the  sphere  head  have  nearly  the  same  volume.  With  the  base  of  the  antenna  at  the  origin,  the 
center  of  the  sphere  is  at  *=1.81,  y=12.15  and  z=-6.43  cm.  This  puts  the  lowest  point  on  the  sphere 
at  the  same  height  as  the  bottom  of  the  handset  case.  The  sphere’s  curved  surface  was 


approximated  with  a  “staircase”  of  2.205  mm  cells.  The  3.65  mm  plexiglas  wall  thickness  is  about 
1.65  cells  thick;  it  was  approximated  as  either  one  cell  or  two  cells  thick,  being  the  difference 
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between  the  staircase  modeling  the  outer  sphere  and  the  inner  sphere  defining  the  plexiglas  wall. 
The  sphere  is  94  cells  or  20.73  cm  in  diameter,  a  reasonable  approximation  of  the  true  20.68  cm 
diameter.  The  FDTD  code  uses  coefficients  for  updating  the  electric  field  on  each  cell  edge  which 
average  the  permittivity  and  conductivity  of  the  four  adjacent  cells.  This  tends  to  smooth  the 
staircased  surface  of  the  sphere.  The  FDTD  cell  space  was  terminated  with  a  PML  as  described 
above.  The  sphere  head  required  a  larger  cell  space  than  the  box  head,  166  by  178  by  188  cells,  with 
the  base  of  the  antenna  on  cell  edge  76, 41,  113. 

Fig.  5  shows  the  vertical  component  of  the  near  field  of  the  sphere  in  an  xz  plane  at  y=-21.8 
mm.  Comparing  the  near  field  of  the  handset  and  box  head,  Fig.4,  and  the  handset  and  sphere  head. 
Fig.  5,  we  see  that  the  field  around  the  tip  of  the  antenna  is  about  the  same  at  42  dB.  At  the  top  of 
the  handset  case  the  field  of  the  handset  and  sphere  head  shows  a  large  45  dB  contour,  considerably 
stronger  than  that  of  the  handset  and  box  head.  Again  at  the  base  of  the  handset  case,  the  handset 
and  sphere  head  field  exceeds  45  dB,  whereas  the  handset  and  box  head  field  has  a  small  39  dB 
contour.  Indeed,  the  fields  of  the  handset  and  sphere  head  are  much  more  like  the  fields  of  the 
handset  alone  in  Fig.  3.  The  square  shape  of  the  box  head  keeps  the  box  close  to  the  handset  case  at 
the  top  and  bottom  of  the  handset  case.  The  round  surface  of  the  sphere  is  close  to  the  handset  near 
the  center  of  the  case,  but  curves  away  from  the  handset  at  the  top  of  the  case  and  particularly  at  the 
bottom  of  the  case. 

Comparing  the  handset  and  sphere  head  fields  of  Fig.  5  with  the  handset  fields  of  Fig.  3,  the 
45  dB  contour  around  the  top  of  the  case  is  larger  with  the  sphere  head  than  for  the  handset  alone. 
The  handset  and  sphere  head  shows  fields  higher  than  45  dB  near  the  bottom  of  the  case,  whereas 
for  the  handset  alone  there  is  no  45  dB  contour. 

Fig.  5  compares  the  FDTD-computed  field(black  contours)  with  the  measured  field(gray 
contours).  The  measured  data  was  scaled  to  the  level  of  the  computed  data  along  the  left  edge  of 
the  contour  map,  as  for  the  box  head.  The  computation  and  measurement  agree  very  well.  The  42 
and  39  dB  contours  at  the  tip  of  the  antenna  almost  coincide.  The  45  dB  contour  at  the  top  of  the 
handset  case  is  almost  the  same.  At  the  bottom  of  the  case  the  39  and  42  dB  contours  correspond 
well.  However,  the  computation  shows  a  small  region  with  fields  in  excess  of  45  dB,  not  seen  in 
the  measurement.  Over  the  surface  of  the  handset  case  the  computed  fields  fall  below  24  dB,  but 
the  measured  fields  never  fall  as  low  as  this. 


Conclusion 

This  paper  has  presented  a  validation  of  FDTD  computations  of  the  vertical  component  of 
the  near  field  of  a  portable  radio  handset  in  a  plane  close  to  the  case  of  the  handset.  The  field 
computed  with  FDTD  was  compared  with  a  moment-method  computation  of  the  near  field  with 
excellent  agreement.  The  near  field  of  the  handset  was  measured  and  shown  to  be  in  very  good 
agreement  with  the  computed  near  field.  The  field  of  the  handset  and  box  and  sphere  models  of  the 
head  were  measured,  and  very  good  agreement  with  the  FDTD  computation  was  shown.  The  fields 
are  strong  near  the  tip  of  the  antenna,  and  near  the  top  and  bottom  of  the  handset  case.  The 
computed  fields  in  these  regions  agree  very  well  with  the  measured  fields  particularly  for  the 
handset  alone  and  for  the  handset  and  sphere  head. 

The  agreement  is  slightly  less  good  for  the  handset  and  box  head.  We  note  that  the  box  head 
cell  model  uses  4  cells  between  the  handset  surface  and  the  box  surface,  or  8.82  mm.  However,  the 
actual  separation  is  9.4  mm,  almost  half  a  cell  further  away.  This  modeling  error  could  be  reduced 
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by  more  sophisticated  material  averaging  in  the  FDTD  code,  or  by  discarding  the  Yee  FDTD 
algorithm  in  favor  of  scheme  able  to  resolve  the  geometry  more  precisely. 

This  paper  compared  the  fields  of  the  handset,  handset  and  box  head,  and  handset  and  sphere 
head  on  the  basis  of  equal  radiated  power.  The  box  head  near  field  in  Fig.  4  is  more  different  from 
the  field  of  the  handset  in  isolation,  Fig.  3,  than  is  the  field  of  the  handset  and  sphere  head  in  Fig.  5. 
The  box  head  reduces  the  field  strength  near  the  top  and  the  bottom  of  the  handset  by  3  dB,  but 
changes  the  field  near  the  tip  of  the  antenna  very  little.  The  sphere  head  curves  away  from  the 
handset,  and  so  the  handset  and  sphere  head  fields  are  comparable  to  those  of  the  handset  alone  near 
the  top  and  bottom  of  the  box. 

A  real  head  is  neither  a  sphere  nor  a  box.  It  is  more  like  a  sphere  at  the  top  of  the  head,  and 
perhaps  more  box-like  at  the  jaw  and  neck.  Nor  is  the  handset  held  vertically  in  normal  operation. 
If  held  in  a  natural  position,  our  rather  large  handset  is  close  to  the  head  at  the  top  of  the  handset, 
above  and  behind  the  ear.  There  the  fields  are  likely  to  be  like  those  of  the  handset  and  box  head. 
But  the  bottom  of  the  handset  juts  out  beyond  the  jaw,  and  so  the  fields  are  likely  to  be  close  to 
those  of  the  handset  and  sphere.  Measurements  are  planned  for  the  handset  and  a  realistic  five- 
tissue  phantom  of  the  head,  with  the  handset  positioned  realistically  relative  to  the  head. 
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FDTD  for  Bioelectromagnetics: 

Modeling  MICS  Implants  in  the  Human  Body 
Eduardo  Villaseca 
Medtronic  Inc. 

Minneapolis,  MN  55432 

Medtronic  Inc.  is  developing  a  communication  system  for  implanted  medical  devices.  For  MICS 
(Medical  Implant  Communications  System)  to  be  successful,  it  is  important  to  be  able  to  predict  the 
electromagnetic  performance  of  such  an  implant. 

Preliminary  models  used  a  mathematical  technique  called  the  Method  of  Moments  to  model  the 
performance  of  an  implanted  device,  but  the  Method  of  Moments  could  not  predict  effects  around  the 

entire  body. 

These  initial  models  were  very  important  in  establishing  the  practicality  of  MICS,  but  they  need 
refinement  and  expansion  before  they  can  provide  the  type  and  quality  of  information  needed  as  MICS 
development  proceeds.  In  particular,  we  need  to  be  able  to  see  the  variation  of  gain  with  location 
around  the  body,  to  ensure  that  MICS  will  operate  reliably,  and  to  evaluate  the  absorption  rate  of 
power  into  the  body  while  MICS  is  operating,  to  ensure  that  MICS  will  operate  safely. 


FDTD  Modeling  .  .  .«.  . 

For  this  more  advanced,  improved  modeling,  we  have  adopted  the  Finite  Difference  Time  Domain,  or 
FDTD,  modeling.  FDTD  was  originally  developed  by  [Yee’jand  has  been  described  extensively  m  the 
literature.  The  method  is  a  direct  solution  of  the  differential  form  of  Faraday’s  and  Ampere’s  laws. 
These  differential  equations  are  converted  into  difference  equations  using  the  central  difference 
approximations.  The  field  components  are  interleaved  on  each  unit  cell,  so  that  the  E  and  H 
components  are  half  a  cell  apart,  which  is  referred  to  as  “leap-frog”  scheme.  In  addition  to  being  leap¬ 
frogged  in  space,  they  are  also  leap-frogged  in  time.  The  E  field  is  assumed  to  be  at  time  nAt,  and  the 
H  field  is  assumed  to  be  at  time  (n+l/2)At.  • 


FDTD  has  become  the  preferred  method  for  bioelectromagnetic  calculations  at  radio  frequencies.  It  is 
efficient  for  modeling  large-scale  heterogeneous  penetrable  objects  —  like  the  human  body.  FDTD 
allows  the  use  of  many  dielectric  constants,  allowing  definition  of  different  types  of  tissue  and  organs 
within  the  body.  The  Method  of  Moments,  in  contrast,  is  computationally  extremely  expensive  to  use 
with  more  than  two  dielectric  constants,  severely  limiting  our  ability  to  define  different  tissue  types. 
The  Method  of  Moments  also  makes  unreasonably  large  demands  on  computer  memory  and  time 
when  modeling  internal  fields  in  electrically  large  lossy  dielectric  bodies,  such  as  the  human  body, 
which  FDTD  can  handle  more  economically  and  simply.  However,  the  Method  of  Moments  works 
excellently  with  wires  and  metal  patches,  which  made  it  the  preferred  choice  for  the  preliminary  MICS 
analysis. 


1  Yee,  K.S.,  Numerical  solution  of  initial  boundary  value  problems  involving  Maxwell’s  equations  in  isotropic  media, 
IEEE  Trans.  Ant.  Prop.,  14(3),  302,  1966 
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FDTD  has  numerous  other  advantages  for  more  advanced  modeling.  It  is  very  accurate  and  fairly  easy 
to  use.  It  operates  with  a  regular,  orthogonal  grid,  with  the  wave  frequencies  of  the  modeling 
dependent  on  the  dimensions  of  the  grid.  In  the  case  of  MICS  modeling,  a  5  mm  grid  of  six  million 
cells  allowed  creation  of  a  model  effective  up  to  1  gigahertz.  Newly  available  high  performance 
absorbing  boundary  conditions  allow  good  computations  throughout  the  body  without  the  distortions 
that  occurred  in  earlier  models  as  a  result  of  reflections  from  the  outer  radiation  boundaries  of  the 
model. 


The  Visible  Human  Model 

MICS  modeling  started  with  a  human  body  model  obtained  from  REMCOM,  Inc.  This  model  was 
created  from  “frozen  man”  slices  from  the  Visible  Human  Project,  obtained  via  the  Internet  and 
meshed  using  custom  software. 

The  bioelectromagnetic  human  body  model  is  a  digitized  version  of  a  real  human  body.  A  donated 
cadaver  was  frozen  and  sliced  in  5  mm  slices.  Each  slice  was  identified  for  parameters  such  as  muscle, 
fat,  and  bone.  The  results  were  digitized  to  create  a  total  numerical  model,  using  a  5  mm  cell,  for  a 
total  of  about  six  million  cells.  Twelve  different  types  of  tissue  were  identified.  Constitutive 
parameters  were  assigned  based  on  the  original  data,  and  checked  and  corrected  using  interactive 
editing. 

Figure  1  shows  a  demonstration  cross  section  of  the  human  body  model. 


Figure  1.  Sample  bioelectromagnetic  human  body  cross-section  model. 
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MICS  Implant 


The  MICS  implant  placed  in  the  chest  is  shown  in  Figure  2. 


Figure  2.  MICS  implant  with  coaxial  antenna. 


Figure  3  shows  the  electromagnetic  field  distribution  in  the  chest  area  of  the  body  with  the  MICS 
implant  in  place  in  the  upper  right  quadrant,  modeled  at  UHF. 


Figure  3.  The  MICS  is  visible  in  the  upper  left  quadrant  of  the  chest  in  this  section. 
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Figure  4  shows  the  coverage  around  the  implant,  in  terms  of  the  azimuth.  An  azimuth  of  90°  is  directly 
in  front  of  the  body,  0°  and  180°  are  to  the  sides,  and  270°  is  directly  behind  the  body. 


Implant 

0° 


Figure  4.  Azimuth  coverage  of  measurements  around  the  MICS  implant. 

Variation  in  Gain  Around  the  Human  Body  Model 

Figure  5  shows  the  results  of  measuring  the  implant  antenna  gain  around  the  body  according  to  the 
azimuth  coverage  in  Figure  4.  The  variation  of  gain  with  azimuth  is  clearly  visible.  The  gain  varies 
with  the  length  of  the  travel  path  around  the  body  to  the  receiver;  overall,  the  combination  of 
horizontal  and  vertical  polarization  yields  a  gain  that  can  vary  from  about  -25  to  about  -40  dBi. 
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Figure  5.  Implant  antenna  gain  as  a  function  of  azimuthal  angle  around  the  body  section  shown  in 

Figure  4. 
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Specific  Absorption  Rate  Modeling 

In  addition  to  evaluating  MICS  performance  by  modeling  the  variations  in  antenna  gain  around  the 
body,  the  human  body  model  can  be  used  to  evaluate  the  safety  of  the  MICS  implant  by  examining  the 
Specific  Absorption  Rate  (S  AR)  in  the  body.  In  general,  the  FDTD  method  calculates  the  time-domain 
vector  E  and  H  fields  at  every  location  inside  and  outside  of  the  body.  These  can  be  converted  to 
frequency  domain  fields  (magnitude  and  phase  at  given  frequencies).  From  them,  values  commonly  of 
interest  in  bioelectromagnetic  simulations  can  be  calculated,  including  specific  absorption  rate,  current 
density,  total  power  absorbed,  temperature  rise,  etc. 

SAR  is  an  important  measure  of  the  amount  of  power  the  implant  is  putting  into  the  human  body  per 
kilogram  of  tissue.  For  near-field  applications,  such  as  a  medical  implant  with  telemetry  transmitter,  it 
is  important  to  determine  if  the  device  complies  with  the  ANSI/IEEE  safety  guidelines[ANSI]  and 
newly-mandated  FCC  guidelines.  These  guidelines  state  that  an  exposure  can  be  considered  to  be 
acceptable  if  it  can  be  shown  that  it  produces  SAR’s  “below  0.08W/kg,  as  average  over  the  whole 
body,  and  spatial  peak  SAR  values  not  exceeding  1.6  W/kg,  as  averaged  over  1  g  of  tissue  (defined  as 
a  tissue  volume  in  the  shape  of  a  cube)”[ANSI]. 

SAR  for  the  MICS  implant  was  evaluated  for  the  Medical  Implant  Communication  System  operating 
at  0  dBm,  or  power  input  of  1  milliwatt  into  the  body.  The  SAR  at  a  given  location  is  given  by  the 
following  formula 


where  a  is  the  electrical  conductivity  and  p  is  the  mass  density  at  the  location  of  interest. 

Three  different  values  must  be  evaluated  to  ensure  that  the  implant  meets  safety  standards: 

•  the  maximum  SAR  considering  total  power  applied  to  total  weight 

•  the  maximum  SAR  considering  the  same  amount  of  power  applied  to  1  gram  of  tissue 

•  the  maximum  SAR  considering  the  same  amount  of  power  applied  to  1 0  grams  of  tissue. 

Figure  6  summarized  these  values  and  showed  the  location  of  the  maximum  in  each  case. 
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Figure  6.Summary  of  SAR  Information 

Figures  7  and  8  illustrate  the  SAR  distribution  and  maximum  for  the  1  g  average  and  10  g  average 
cases  respectively. 


Figure  7.  1  g  Average  SAR  (W/kg).  The  maximum  SAR  occurs  in  the  location  of  the  connector 

terminal  to  the  implant. 
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Figure  8.  10  g  Average  SAR  (W/kg).  As  in  Figure  7,  the  maximum  SAR  is  the  location  of  the 
connector  terminal  to  the  implant 


Implant  Safety  Conclusions 

Figures  6-8  illustrate  the  important  SAR  conclusions: 

•  The  SAR  average  over  the  whole  body  is  1 .8e-05  W/kg.  This  is  36  dB  below  the  ANSI  safety 
standard  of  0.08  W/kg. 

•  The  maximum  10  g  average  SAR  is  1.9e-02  W/kg.  This  is  23  dB  below  the  ANSI  safety  standard 
of  4  W/kg. 

•  The  maximum  1  g  average  SAR  is  6.7e-02  W/kg.  This  is  14  dB  below  the  ANSI  safety  standard  of 
1.6  W/kg. 

Summary  of  FDTD  Modeling  Conclusions 

FDTD  modeling  has  been  used  for  the  first  time  with  the  bioelectromagnetic  human  body  model  to 
obtain  implant  performance.. 

The  results  of  modeling  implant  antenna  gain  show  a  variation  of  gain  with  location  around  the  body 
that  must  be  accounted  for  to  ensure  that  MICS  will  operate  reliably. 

Analysis  of  SAR  over  the  total  body,  over  1  g  of  tissue,  and  over  10  g  of  tissue  reveal  an  absorption 
rate  of  power  into  the  body  well  within  the  standard  guidelines  for  safety,  demonstrating  the  ability  of 
MICS  to  operate  safely  within  the  human  body. 
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Abstract 

We  are  investigating  a  new  ultrawideband  microwave  radar  technology  to  detect  and  image  early-stage 
malignant  breast  tumors  that  are  often  invisible  to  X-rays.  In  this  paper,  we  present  the  methodol¬ 
ogy  and  results  of  two-  and  three-dimensional  finite-difference  time-domain  simulations.  The  discussion 
concentrates  on  the  investigation  of  signal-to-clutter  ratios  and  dynamic  range  requirements  of  the  mi¬ 
crowave  system.  Our  FDTD  studies  have  demonstrated  that  the  sensitivity  and  dynamic  range  of  an 
optimized  antenna  array  in  conjunction  with  existing  microwave  equipment  should  be  adequate  to  detect 
noncalcified  tumors  less  than  5  mm  in  diameter  located  at  depths  of  up  to  about  5  cm  in  the  breast. 

I.  Introduction 

X-ray  mammography  is  currently  the  most  effective  screening  modality  for  detecting  nonpalpable  breast 
tumors.  However,  X-ray  mammography  has  limited  sensitivity  and  specificity  in  1)  assessing  dense  glan¬ 
dular  tissue  and  tissue  close  to  the  chest  wall  or  underarm  and  2)  imaging  very  early-stage  tumors  that  do 
not  yet  exhibit  microcalcifications.  According  to  the  National  Cancer  Institute,  screening  mammograms 
miss  up  to  25%  of  all  breast  cancers.  A  related  concern  is  the  high  rate  of  false  positives  that  result 
in  unnecessary  biopsies.  Since  these  problems  remain  unresolved,  complementary  modalities  based  on 
imaging  other  physical  tissue  properties  are  needed  to  further  improve  detection  capabilities. 

We  have  recently  proposed  a  novel  pulsed  microwave  confocal  system  for  the  detection  of  early-stage 
breast  cancer  [1,  2,  3,  4].  Our  system  has  the  potential  to  detect  very  small  non-calcified  cancers,  including 
those  in  radiographically  dense  breasts  and  in  regions  near  the  chest  wall  or  underarm.  Furthermore, 
this  noninvasive  approach  avoids  exposure  to  ionizing  radiation  and  does  not  require  breast  compression. 
The  microwave  exposure  is  well  within  the  safety  limits  set  by  ANSI/IEEE  [5].  The  safety,  comfort, 
ease-of-use,  and  low-cost  features  should  permit  frequent  screening.  Augmenting  X-ray  mammography 
in  this  manner  could  help  to  reduce  the  number  of  false  negatives  and  false  positives. 

The  physical  basis  of  this  technology  is  the  differential  microwave  backscatter  response  from  tissues 
based  on  their  water  content,  a  tissue-radiation  interaction  mechanism  that  is  distinct  from  density-based 
attenuation  of  X-rays.  As  shown  in  Table  1,  the  differing  water  content  of  normal  and  malignant  breast 
tissues  results  in  an  order-of-magnitude  dielectric-property  contrast  at  microwave  frequencies  [6,  7,  8]. 
Thus  a  malignant  tumor  has  a  significantly  larger  radar  cross  section  than  that  of  normal,  fatty  breast 
tissue.  Normal  breast  tissue  dielectric  heterogeneity  is  observed  on  a  distance  scale  of  about  0.5  cm  and 
is  bounded  in  a  ±10%  range  about  the  nominal  values  given  in  Table  1. 
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tissue  type 

er 

a  (S/m) 

normal  fatty  breast 

9.0 

0.4 

tumor 

50.0 

7.0 

skin  [6] 

36.0 

4.0 

vein  [6] 

50.0 

7.0 

mammary  glands  (15%  higher  than  fat) 

10.45 

0.46 

mammary  glands  (50%  higher  than  fat) 

13.5 

0.6 

Table  1:  Dielectric  parameters  of  breast  tissue  at  6  GHz. 

Our  pulsed  microwave  confocal  system  is  based  on  ultrawideband  radar  technology  and  confocal  op¬ 
tical  microscopy.  The  microwave  sensor  is  comprised  of  an  electronically  scanned  antenna  array.  Each 
ultrawideband  antenna  element  located  at  a  particular  position  on  the  surface  of  the  breast  is  excited  and 
the  backscattered  waveform  is  collected.  This  is  repeated  in  sequence  for  the  other  elements  in  the  array. 
The  low  attenuation  in  normal  breast  tissue  (less  than  4  dB/cm  up  to  10  GHz)  permits  constructive  addi¬ 
tion  of  the  wideband  backscattered  returns.  Therefore,  as  a  post-processing  step,  the  set  of  backscattered 
waveforms  are  variably  time-shifted  to  achieve  coherent  addition  for  a  desired  virtual  focal  point  within 
the  breast.  Backscatter  from  off-focus  scatterers  in  the  heterogeneous  breast  add  incoherently  and  are 
thereby  suppressed.  We  note  that  our  approach  has  no  relation  to  matrix  schemes  using  microwave  or 
impedance  measurements,  microwave  backscatter  methods  based  on  broad  illumination  of  the  breast,  or 
techniques  involving  passive  thermography  or  active  tomography.  Three  wide-ranging  U.S.  patents  for 
our  technology  were  awarded  recently,  and  several  additional  patents  are  pending. 

Two  key  performance  specifications  for  the  microwave  sensor  are  the  signal-to-clutter  (S/C)  ratio, 
defined  as  the  ratio  of  the  peak  backscatter  return  from  a  tumor  to  the  peak  backscatter  return  from 
clutter,  and  the  system  dynamic  range,  defined  as  the  ratio  of  the  peak  pulse  power  of  the  source  to 
the  system  noise  floor  due  to  reverberations  and  thermal  noise.  We  have  developed  two-  and  three- 
dimensional  finite-difference  time-domain  (FDTD)  [9,  10]  models  of  the  microwave  sensor  to  analyze  S/C 
ratios  and  the  dynamic  range  requirements,  and  to  assess  the  overall  feasibility  of  the  proposed  system 
for  detecting  early-stage  breast  cancer. 

II.  FDTD  Simulations  of  the  Coherent- Addition  Antenna  Array:  Signal-to-Clutter 
Ratios 

Fig.  la  illustrates  the  FDTD  model  of  a  microwave  sensor  comprised  of  a  17-element  monopole  array 
spanning  8  cm.  The  monopoles  are  spaced  at  0.5-cm  intervals  along  the  surface  of  a  1-mm-thick  skin 
layer.  A  0.5-cm-diameter  circular  tumor  is  embedded  in  a  slab  of  normal  fatty  breast  tissue  at  a  depth 
of  3.0  cm  directly  below  the  center  of  the  monopole  array.  To  simulate  the  heterogeneity  of  the  normal 
breast  tissue  [7,  8],  ±10%  random  fluctuations  of  er  and  o  were  assigned  to  the  breast  tissue  half-space 
in  a  checkerboard  pattern  of  5-mm  square  blocks. 

The  modeling  procedure  involves  simulating  an  impulsive  excitation  of  each  monopole  element,  one 
at  a  time,  then  recording  the  backscattered  pulse  response  observed  at  the  excited  element.  The  pulse 
excited  by  the  transmitting  antenna  is  a  differentiated  Gaussian  with  the  following  time  dependence: 

V{t)  =V0{t-  t0)e-^-to)/r]2  volts  (1) 

where  r  =  0.04  ns,  and  i0  =  4r.  This  pulse  has  a  temporal  width  of  110  ps  (FWHM),  an  amplitude 
spectral  width  of  9.0  GHz  (FWHM),  and  zero  dc  content.  We  chose  a  different  pulse  shape  here  from  that 
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reported  in  our  previous  publications  to  verify  that  our  proposed  detection  system  is  robust  relative  to  the 
precise  shape  of  the  exciting  pulse.  As  a  post-processing  operation  on  the  FDTD-calculated  backscatter 
data,  the  17  individual  waveforms  are  time-shifted  to  achieve  a  coherent  sum  of  the  backscattered  response 
for  the  desired  focal  point.  Varying  the  distribution  of  these  time  shifts  across  the  array  creates  the 
synthetic  focal  point.  For  each  investigation,  simulations  are  performed  with  and  without  the  tumor 
present.  The  latter  case  establishes  the  background  clutter.  Fig.  2a  depicts  the  received  power  waveform 
upon  coherently  summing  the  17  backscatter  responses.  The  solid  curve  represents  the  composite  received 
signal  from  the  0.5-cm-diameter.  The  dotted  curve  represents  the  composite  clutter  signal  received 
without  the  tumor  present.  The  tumor  response  is  clearly  visible. 


b) 

Figure  1:  a)  2-D  FDTD  computational  model  of  the  17-element  sensor  placed  at  the  surface  of  the  breast. 
Here,  the  breast  model  includes  skin,  heterogeneous  normal  breast  tissue,  and  a  0.5-cm  diameter  tumor 
located  at  a  depth  of  3.0  cm.  b)  Computational  model  with  mammary  ducts  and  lobules  added. 

To  accurately  quantify  the  S/C  ratio,  it  is  convenient  to  subtract  the  exponential  decay  from  the 
two  curves  in  Fig.  2a.1  Fig.  2b  shows  the  results  after  performing  the  subtraction  and  normalizing  the 
curves.  Upon  forming  the  ratio  of  the  peak  backscattered  pulse  amplitude  with  the  tumor  present  to  the 
peak  amplitude  without  the  tumor  present,  the  S/C  ratio  is  found  to  be  greater  than  30  dB.  (In  our  prior 
publications,  the  S/C  ratios  were  estimated  without  subtracting  the  exponential  decay  and  as  a  result 
represent  an  underestimate  of  the  actual  ratios.)  Fig.  3  graphs  the  calculated  S/C  ratio  as  a  function  of 
tumor  diameter  for  a  tumor  fixed  in  position  at  the  in-breast  synthetic  focus.  These  results  show  that 
tumors  having  diameters  as  small  as  1  mm  yield  responses  that  are  well  above  the  background  clutter, 
illustrating  the  potential  for  high  sensitivity  with  our  proposed  detection  method.  As  reported  in  [4],  we 
also  tested  the  performance  of  the  system  under  the  condition  wherein  the  heterogeneous  normal  breast 
tissue  is  assumed  to  have  a  Debye  dispersion.  The  frequency-dependent  parameters  were  incorporated 
into  the  FDTD  algorithm  using  the  auxiliary  differential  equation  method  [11].  We  observe  essentially 
no  degradation  of  the  S/C  ratio  relative  to  the  corresponding  nondispersive  modeling  results. 

Fig.  lb  illustrates  the  geometry  that  results  from  surrounding  the  0.5-cm-diameter  tumor  with  a 
cluster  of  mammary  lobes  and  ducts.  Here,  the  simulated  gland  cluster  is  modeled  in  the  FDTD  grid 
by  positioning  a  group  of  ellipses  having  higher  eT  and  a  than  normal  breast  tissue.  Since  we  have  not 
found  measured  data  on  the  dielectric  properties  of  mammary  glands,  we  have  performed  sensitivity 
tests  of  our  assumed  values  of  er  and  a  for  these  structures.  For  example,  Figs.  4a  and  4b  graph  the 
post-processed  backscattered  signals  for  mammary  glands  having  15%  and  50%  higher  er  and  a  than 
normal  breast  tissue,  respectively.  In  comparing  Figs.  4a  and  4b  with  Fig.  2a,  it  is  clear  that  the  tumor 

lA  third  simulation  is  performed  for  the  geometry  of  Fig.  la  without  the  tumor  and  without  the  normal  breast  tissue 
heterogeneity.  The  composite  received  signal  for  this  homogeneous  case  consists  solely  of  the  exponential  decay  in  the  time 
window  from  0.5  ns  to  1.4  ns.  It  is  this  signal  that  is  used  to  subtract  off  the  exponential-decay  baseline  of  Fig.  2a. 
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response  is  only  slightly  degraded.  We  note  that  these  results  axe  obtained  using  the  same  distribution 
of  time  delays  across  the  antenna  array  used  in  the  previous  model  of  Fig.  la.  Optimizing  the  time 
delays  should  mitigate  the  minor  level  of  degradation  observed.  Apparently,  the  modeled  cluster  of  lobes 
and  ducts  generates  a  backscattered  return  that  propagates  incoherently  to  the  multiple  observation 
locations.  Similar  to  the  modeled  random  heterogeneity  of  the  fatty  breast  tissue,  this  clutter  signal  is 
suppressed  upon  coherent  addition  in  favor  of  the  tumor’s  signature. 


a)  b) 

Figure  2:  a)  FDTD-computed  time-domain  waveforms  resulting  from  time-shifting  and  summing 
backscattered  responses  for  the  system  shown  in  Fig.  la.  b)  Waveforms  after  subtracting  off  the  expo¬ 
nential  decay  from  both  curves  of  Fig.  2a. 


Figure  3:  S/C  ratio  for  the  backscattered  response  of  the  tumor  located  at  the  in-breast  synthetic  focus, 
shown  in  Fig.  la,  as  a  function  of  the  tumor  diameter. 
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a)  b) 

Figure  4:  FDTD-computed  time-domain  waveforms  for  the  tissue  geometry  of  Fig.  lb.  The  mammary 
ducts  and  lobules  are  assigned  values  of  er  and  a  that  are  (a)  15%  or  (b)  50%  higher  than  the  nominal 
values  used  for  the  heterogeneous  surrounding  breast  tissue. 


III.  FDTD  Simulations  of  an  Antenna- Array  Element:  Dynamic  Range  Requirements 

We  have  conducted  3-D  FDTD  simulations  to  design  a  wideband  bowtie  antenna  for  use  as  an  element  in 
the  pulsed  microwave  confocal  array  [12,  13].  The  bowtie  antenna  has  a  flare  length  of  4  cm  and  a  flare 
angle  of  53°.  As  reported  in  [12,  13],  the  antenna  is  resistively  loaded  in  order  to  suppress  the  reflections 
from  the  ends  of  the  bowtie.  The  antenna  is  embedded  within  a  large  block  of  lossy  dielectric  material 
that  matches  the  dielectric  parameters  of  normal  breast  tissue.  In  the  FDTD  models,  the  bowtie  antenna 
element  is  located  at  the  surface  of  the  breast.  The  breast  model  is  comprised  of  a  1-mm-thick  layer  of 
skin  and  a  homogeneous  half-space  of  normal  breast  tissue.  The  slanted  edges  of  the  bowtie  antenna 
are  approximated  using  staircasing  with  a  sub-mm  spatial  grid  resolution.  The  zero-dc  ultrawideband 
excitation  is  implemented  as  a  1-V,  50- SI  resistive  voltage  source  at  the  antenna  feed  point  [14].  The 
FDTD  grid  is  terminated  with  a  perfectly  matched  layer  absorbing  boundary  condition  [15].  The  resistive 
loading  together  with  the  lossy  nature  of  the  biological  tissue  and  the  bandpass  nature  of  the  excitation 
drops  the  antenna  reverberation  to  -125  dB  relative  to  the  exciting  pulse. 

Using  the  resistively  loaded  bowtie  antenna,  we  have  performed  benchmark  simulations  to  estimate 
the  dynamic  range  requirements  of  the  microwave  system  for  detecting  a  compact  breast  tumor  located 
directly  below  the  antenna  feed  point.  The  dynamic  range  should  be  large  enough  to  permit  detection 
of  a  tumor  of  specified  size  and  depth.  The  depth  of  a  typical  normal,  non-lactating  human  breast  is  on 
the  order  of  5  cm  (for  example,  see  [16]).  This  suggests  that  a  flattened  breast  of  a  patient  in  supine 
position  would  span  less  than  5  cm  between  the  skin  surface  and  the  rib  cage.  Further,  almost  50%  of 
all  breast  tumors  occur  in  the  quadrant  near  the  underarm  where  the  breast  is  less  than  about  2.5  cm 
deep  [17].  Accordingly,  we  have  based  our  computational  models  of  the  confocal  microwave  system  on 
detecting  tumors  to  depths  of  up  to  5  cm  with  a  typical  depth  of  3  -  4  cm. 

To  determine  the  dynamic  range  required  to  detect  a  tumor  of  a  specific  diameter  and  depth,  the 
peak-to-peak  amplitude  of  the  backscattered  response  of  the  tumor  is  compared  with  the  peak-to-peak 
amplitude  of  the  exciting  pulse.  In  each  simulation,  we  record  the  FDTD-computed  magnetic  field 
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tumor  depth 

tumor  diameter 

tumor  response 

3.0  cm 

5.28  mm 

-83  dB 

3.52  mm 

-88  dB 

1.76  mm 

-96  dB 

4.0  cm 

5.28  mm 

-92  dB 

3.52  mm 

-97  dB 

1.76  mm 

-106  dB 

5.0  cm 

5.28  mm 

-101  dB 

3.52  mm 

-106  dB 

1.76  mm 

-115  dB 

Table  2:  Normalized  backscatter  data  as  a  function  of  the  tumor  diameter  and  depth. 

circulating  the  voltage  source  at  the  feed  point,  since  this  field  is  proportional  to  the  induced  current. 
The  simulation  is  performed  for  tumor  diameters  of  5.28  mm,  3.52  mm,  and  1.76  mm  at  depths  of  3.0 
cm,  4.0  cm,  and  5.0  cm.  The  backscatter  response  levels  are  tabulated  in  Table  2.  The  peak  power  in 
the  backscatter  return  drops  approximately  9  dB  per  cm  increase  in  the  depth  of  the  tumor. 

The  backscatter  response  for  the  “worst  case”  tumor  studied  here  (diameter  of  1.76  mm,  depth  of 
5.0  cm)  is  seen  to  be  -115  dB  relative  to  the  source  power.  We  note  that  the  -125  dB  reverberation  due 
to  reflections  from  the  ends  of  the  resistively  loaded  bowtie  antenna,  reported  in  [13],  is  sufficiently  low 
enough  to  permit  backscattered  returns  from  this  tumor  to  be  sensed.  Furthermore,  we  have  observed 
a  dynamic  range  in  the  order  of  120  dB  for  the  Hewlett-Packard  HP8720D  vector  network  analyzer 
when  properly  configured  and  programmed  with  processing  times  adequate  for  preclinical  testing.  This 
can  be  improved  to  135  dB  with  minor  modifications.  Thus,  given  the  discussion  above,  our  optimized 
resistively  loaded  bowtie  antenna  in  combination  with  commercial  vector  network  analyzers  yield  more 
than  adequate  dynamic  range  for  our  tumor-detection  system. 

V.  Summary  and  Conclusions 

This  paper  presented  the  methodology  and  results  of  FDTD  simulations  of  an  UWB  microwave  sensor 
array  to  detect  and  image  early-stage  malignant  breast  tumors.  Our  2-D  FDTD  studies  of  the  S/C  ratio 
demonstrate  the  robustness  of  the  coherent-addition  process  relative  to  the  dielectric  properties  and 
complexity  of  the  breast.  We  conducted  3-D  FDTD  simulations  to  study  the  co-polarized  backscattered 
response  of  small  idealized  spherical  tumors  (diameters  less  than  or  equal  to  5  mm)  located  at  depths  up 
to  5.0  cm  within  the  homogeneous  breast.  The  results  reported  here  and  in  [4,  13]  indicate  that  a  system 
realized  with  existing  microwave  equipment  has  sufficient  sensitivity  and  dynamic  range  to  detect  small 
tumors  less  than  5  mm  in  diameter  located  within  5  cm  of  the  skin  surface.  Using  these  simulations  as 
a  design  tool,  we  are  constructing  a  sensor  system  for  pre-clinical  experimentation. 

VI.  Acknowledgments 

This  work  was  supported  in  part  by  the  Small  Business  Innovative  Research  Grant  1-R43-CA67598-D1A2 
from  the  National  Institutes  of  Health  and  internal  funding  from  Interstitial,  Inc.  Computing  resources 
were  provided  by  Cray  Research,  Inc.  We  gratefully  acknowledge  technical  guidance  from  Prof.  Alan 
Sahakian  of  Northwestern  University,  Dr.  Melvin  Griem  of  the  University  of  Chicago,  Dr.  John  Aarsvold 
of  the  Atlanta  VA  Medical  Center,  and  Dr.  Frederick  Kelcz  of  the  University  of  Wisconsin-Madison. 


680 


VII.  References 

[1]  S.  C.  Hagness,  A.  Taflove,  and  J.  E.  Bridges,  “FDTD  analysis  of  a  pulsed  microwave  confocal  system 
for  breast  cancer  detection,”  in  Proc.  Int.  Con f.  IEEE  Engineering  in  Medicine  and  Biology  Soc ., 
Chicago,  IL,  Oct.  1997,  pp.  2506-2508. 

[2]  S.  C.  Hagness,  A.  Taflove,  and  J.  E.  Bridges,  “FDTD  modeling  of  a  coherent-addition  antenna  array 
for  early-stage  detection  of  breast  cancer,”  in  IEEE  Antennas  and  Propagat.  Soc.  Int.  Symposium 
Digest,  Atlanta,  GA,  June  1998,  vol.  2,  pp.  1220-1223. 

[3]  M.  Popovic,  S.  C.  Hagness,  A.  Taflove,  and  J.  E.  Bridges,  “2-D  study  of  fixed-focus  elliptical  reflector 
system  for  breast  cancer  detection:  Frequency  window  for  optimum  operation,”  in  IEEE  Antennas 
and  Propagat.  Soc.  Int.  Symposium  Digest ,  Atlanta,  GA,  June  1998,  vol.  4,  pp.  1992-1995. 

[4]  S.  C.  Hagness,  A.  Taflove,  and  J.  E.  Bridges,  “Two-dimensional  FDTD  analysis  of  a  pulsed  microwave 
confocal  system  for  breast  cancer  detection:  Fixed-focus  and  antenna-array  sensors,”  IEEE  Trans. 
Biomed.  Eng.,  vol.  45,  no.  12,  pp.  1470-1479,  Dec.  1998. 

[5]  ANSI/IEEE  C95. 1-1992,  American  National  Standard  -  Safety  Levels  with  Respect  to  Human  Ex¬ 
posure  to  Radio  Frequency  Electromagnetic  Fields ,  3  kHz  to  300  GHz,  IEEE  Press,  New  York, 
1992. 

[6]  C.  Gabriel,  S.  Gabriel,  and  E.  Corthout,  “The  dielectric  properties  of  biological  tissues:  I.  Literature 
survey,”  Phys.  Med.  Biol.,  vol.  41,  no.  11,  pp.  2231-2249,  Nov.  1996. 

[7]  W.  T.  Joines,  Y.  Z.  Dhenxing,  and  R.  L.  Jirtle,  “The  measured  electrical  properties  of  normal  and 
malignant  human  tissues  from  50  to  900  MHz,”  Med.  Phys.,  vol.  21,  pp.  547-550,  Apr.  1994. 

[8]  S.  S.  Chaudhary,  R.  K.  Mishra,  A.  Swarup,  and  J.  M.  Thomas,  “Dielectric  properties  of  normal  and 
malignant  human  breast  tissues  at  radiowave  and  microwave  frequencies,”  Indian  J.  Biochem.  and 
Biophys.,  vol.  21,  pp.  76-79,  Feb.  1984. 

[9]  K.  S.  Kunz  and  R.  J.  Luebbers,  The  Finite  Difference  Time  Domain  Method  for  Electromagnetics, 
CRC  Press,  Boca  Raton,  FL,  1993. 

[10]  A.  Taflove,  Computational  Electrodynamics:  The  Finite-Difference  Time-Domain  Method ,  Artech 
House,  Boston,  MA,  1995. 

[11]  R.  M.  Joseph,  S.  C.  Hagness,  and  A.  Taflove,  “Direct  time  integration  of  Maxwell’s  equations  in  linear 
dispersive  media  with  absorption  for  scattering  and  propagation  of  femtosecond  electromagnetic 
pulses,”  Optics  Lett.,  vol.  16,  no.  18,  pp.  1412-1414,  1991. 

[12]  S.  C.  Hagness,  A.  Taflove,  and  J.  E.  Bridges,  “Wideband  ultralow  reverberation  antenna  for  biolog¬ 
ical  sensing,”  Electron.  Lett.,  vol.  33,  no.  19,  pp.  1594-1595,  1997. 

[13]  S.  C.  Hagness,  A.  Taflove,  and  J.  E.  Bridges,  “Three-dimensional  FDTD  analysis  of  a  pulsed 
microwave  confocal  system  for  breast  cancer  detection:  Design  of  an  antenna-array  element,”  IEEE 
Trans.  Antennas  Propagat.,  in  press. 

[14]  M.  Piket-May,  A.  Taflove,  and  J.  Baron,  “FD-TD  modeling  of  digital  signal  propagation  in  3-D 
circuits  with  passive  and  active  loads,”  IEEE  Trans.  Microwave  Theory  Tech.,  vol.  42,  no.  8,  pp. 
1514-1523,  Aug.  1994. 

[15]  J.-P.  Berenger,  “A  perfectly  matched  layer  for  the  absorption  of  electromagnetic  waves,”  J.  Comput. 
Phys.,  vol.  114,  no.  1,  pp.  185-200,  1994. 

[16]  X.  Wu,  G.  T.  Barnes,  and  D.  M.  Tucker,  “Spectral  dependence  of  glandular  tissue  dose  in  screen-film 
mammography,”  Radiology,  vol.  179,  no.  1,  pp.  143-148,  Apr.  1991. 

[17]  W.  H.  Parsons,  Cancer  of  the  Breast,  Charles  Thomas,  Springfield,  IL,  1959. 


681 


Numerical  Dosimetry  for  Human  Occupational  Exposure  to  Realistic 

60-Hz  Magnetic  Fields 


Trevor  W.  Dawson,  K.  Caputa  and  Maria  A.  Stuchly 
Dept.  Elec.  &  Comp.  Eng.,  University  of  Victoria 
PO  Box  3055  Stn  CSC  Victoria  BC  Canada  V8W  3P6 


Abstract 

Human  exposure  to  occupational  non-uniform  60-Hz  magnetic  fields  produced  by  realistic  three-phase  con¬ 
ductor  systems  is  evaluated  numerically.  Two  specific  scenarios  are  considered.  The  first  involves  a  seated 
worker  performing  cable  maintenance  in  an  underground  vault  with  conductors  carrying  500  A  (rms)  per  phase. 
The  second  involves  a  standing  worker  near  is^phase  buses  of  a  700  MW  generator,  with  conductors  carrying 
20  kA  (rms)  per  phase.  The  modeling  is  based  on  a  scalar  potential  finite  difference  (SPFD)  method,  applied 
to  a  high-resolution  (3.6  mm)  voxel  conductivity  model  of  the  human  body.  Good  correspondence  between  var¬ 
ious  external  field  measures  and  the  associated  induced  field  measures  in  various  organs  is  observed  The  lower 
currents  in  the  vault  conductors  produce  correspondingly  low  induced  current  densities,  with  tissue  average 
values  typically  below  0.2  mA  m-2.  Conversely,  the  generator  bus  currents  result  in  1.5-mT  fields  at  a  distance 
of  1.2  m.  The  resulting  organ-average  induced  current  densities  are  in  the  2-8  mA  m  range,  with  maximum 
values  above  10  mA  m-2.  A  comparison  with  uniform  field  exposures  indicates  that  induced  fields  in  organs 
can  be  reasonably  weU-estimated  from  the  accurately  computed  exposure  fields  averaged  over  the  organs  and 
the  organ  dosimetric  data  for  uniform  magnetic  fields.  The  nonuniform  field  exposures  generally  result  in  lower 
induced  fields  than  those  for  uniform  fields  of  the  same  intensity. 


1  Introduction 

While  the  issue  of  health  effects  of  60-Hz  magnetic  fields  remains  unresolved  [NIEHS,  1998],  there  are  established 
biological  interactions  for  fields  strong  enough  to  induce  tissue  current  densities  of  the  order  of  10  mA  m 
\ICNIRP,  1998].  This  level  is  used  as  a  benchmark  in  safety  guidelines  [ICNJRP ,  1998].  Human  exposure  to 
relatively  high  magnetic  flux  densities  most  often  occurs  in  occupational  settings.  Numerical  modeling  involving 
realistic  conductivity  representations  of  the  human  body  can  provide  a  useful  tool  for  investigating  the  connection 
between  the  external  field  and  the  induced  internal  fields.  Earlier  modeling  of  worker  live-line  exposures  has 
considered  mostly  high-voltage  transmission  lines  [Stuchly  and  Zhao,  1996;  Dawson  et  al.,  1998].  In  all  cases,  the 
current-carrying  conductors  have  been  represented  as  infinite  straight-line  sources.  Such  models  clearly  are  not 
appropriate  for  modeling  representative  exposures  to  complicated  conductor  systems  such  as  in  power  generating 
plants  or  substations.  The  present  work  extends  the  source  modeling  to  multi-phase  conductor  systems  composed 
of  rectilinear  elements,  including  finite  line  segments,  as  well  as  half-  and  full-lines  (to  model  current  systems  closed 
“at  infinity”).  This  expanded  source  treatment  is  used  in  conjunction  with  realistic  high-resolution  conductivity 
models  of  the  human  body  in  representative  postures.  Two  scenarios  are  considered.  The  first  scenario  pertains  to 
a  seated  worker  performing  maintenance  of  secondary  cables  and  bus  bars  in  an  underground  vault  m  a  substation 
This  is  a  three-phase  current  configuration  with  500-A  (rms)  per  phase.  The  source  model  involves  a  total  of 
9finite  segments  and  half-lines.  The  second  scenario  involves  an  upright  worker  inspecting  the  isophase  buses  of  a 
700  MW  generator.  There  are  three  conductor  systems  carrying  20  kA  per  phase,  with  the  currents  summing  to 
zero  at  the  star  point.  The  source  model  involves  11  elements,  either  finite  segments  or  half-lines. 

This  research  has  three  objectives.  The  first  is  a  realistic  evaluation  of  the  electric  fields  and  current  densities 
induced  in  various  organs  of  the  human  body  by  nonuniform  magnetic  fields  in  representative  electric  utility  work 
locations.  The  second  objective  is  a  comparison  of  the  resulting  nonuniform  dosimetry  to  recently  published  data 
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for  uniform  field  exposures,  e.g.  [Dawson  and  Stuchly,  1998;  Dawson  et  al.,  1997].  The  third  objective  is  to  consider 
induced  field  levels  in  light  of  various  quantitative  measures  of  the  source  magnetic  field.  The  latter  are  simpler  to 
compute.  Therefore,  it  would  be  useful  to  have  “rule-of-thumb”  estimates  of  the  true  induced  fields  obtained  from 
local  measures  of  the  non-uniform  source  field  combined  with  dosimetric  data  for  uniform  field  exposures 


2  Methods 

2.1  Body  Models 

All  computations  involve  heterogeneous  models  of  the  human  body  comprising  cubic  voxels  with  3.6-mm  edges.  The 
present  models  are  derived  from  the  basic  upright  configuration  used  in  earlier  uniform  field  calculations  [Dawson 
et  al.,  1997;  Dawson  and  Stuchly,  1998].  The  limbs  of  the  original  model  have  been  numerically  articulated  to 
obtain  the  required  postures.  Care  has  been  taken  to  maintain  electrical  continuity  of  bone,  muscle,  blood  vessels 
and  skin.  The  models  have  some  80  tissues  identified,  with  each  voxel  is  assigned  a  recently  measured  conductivity 
value  corresponding  to  its  dominant  tissue  type.  The  model  resolution  is  sufficiently  high  to  resolve  all  major  body 
components,  as  well  as  some  of  the  smaller  organs  and  tissues. 


2.2  Exposure  Scenarios 

The  two  scenarios  are  illustrated  in  Figures  1  .  Panels  (a)  and  (b)  show  the  arrangement  of  the  body  model  and  the 
9  current-carrying  elements  in  the  vault  maintenance  scenario.  All  conductors  are  spaced  5.08  cm  on-centre,  with 
one  cable  pair  and  bus-bar  for  each  conductor  (and  phase).  The  lower  and  upper  horizontal  cables  are  respectively 
1.05  m  and  2.1  m  above  the  floor.  The  bus-bars  are  0.52  m  apart.  Each  phase  carries  500  A  (rms).  The  workers 
midplane  is  approximately  0.3  m  to  the  right  of  the  nearest  bus-bar.  Inspection  of  the  isophase  buses  of  a  700-MW 
generator  are  shown  in  panel  (c)  of  Fig.  1.  Two  overhead  horizontal  segments  within  the  neutral  box  meet  at 
the  star  point,  and  3  angled  segments  run  to  the  generator  (which  is  itself  not  modeled).  Three  vertical  segments 
run  from  generator  to  lead-box,  and  thence  three  half-lines  run  to  a  remote  transformer.  There  are  a  total  of  11 
conductors,  carrying  20  kA  (rms)  per  phase.  The  lower  and  upper  horizontal  conductors  are  respectively  1.2  m 
and  3  m  above  the  ground.  The  conductors  are  0.9  m  apart.  The  worker  is  approximately  1.2  m  from  the  closest 
(vertical  right)  conductor  in  the  lead  box. 

It  is  assumed  that  any  protective  suit  is  completely  permeable  to  the  60-Hz  source  magnetic  field.  Any  con¬ 
ductors  other  than  the  the  human  body  and  the  primary  magnetic  field  sources  described  above  are  neglected.  All 
source  conductors  are  assumed  to  be  infinitely  thin  in  the  transverse  dimension.  These  simplifying  assumptions 
introduce  negligible  errors  in  modeling  of  the  actual  exposure  situations. 


2.3  Computational  Method 

The  numerical  modeling  is  based  on  a  representation  which  reflects  the  quasistatic  approximation  (Dawson  et  al., 
1997),  namely 

Ei(r)  =  -ju{A0(r)  +  Vil>(r)}.  (1) 

for  the  internal  electric  field,  El{r).  Here  ip(r)  is  a  scalar  potential,  w  =  2 irf  denotes  the  radian  frequency,  and  / 
(=60  Hz)  is  the  frequency  used  in  the  modeling.  The  associated  period  is  T  =  1/f.  The  applied  magnetic  field  is 
described  by  a  vector  potential,  with  ,  Bq(t)  =  V  x  Ao(t).  A  common  time-factor  e+jwt  is  suppressed  throughout. 
With  <x(r)  denoting  the  body  conductivity  distribution,  the  scalar  potential  must  satisfy  the  differential  equation 

V-  [rr(r)  Vip(r)\  =  -V-  [<r(r)  j40(r)]  (2) 

subject  to  the  boundary  condition  of  tangential  current  flow  at  the  surface.  These  equations  are  valid  for  low 
frequencies  in  the  absence  of  any  external  applied  electric  field. 
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Figure  1:  Two  views  (a,  b)  of  the  underground  vault  maintenance  scenario,  and  one  (c)  of  the 
generator  inspection  scenario. 


In  polar  coordinates  (p,  <p,  z )  and  using  the  Biot-Savart  law  [7],  a  current  i0  flowing  along  the  z-axis  from  *  - 
to  z  =  j3  may  be  interpreted  as  contributing  a  partial  magnetic  field 

Ho{r\a,0)  =  Ko^^-<^^  where  K°  =  (^)  and  Ar  =  +  (* "  T)3-  (3) 

This  partial  field  is  non-physical  since  it  is  not  divergence-free,  but  must  form  part  of  a  larger  closed  current 
system.  It  has  well-defined  half-line  limits  Jf0(r; -oo,/?)  and  if0(r;a, +oo),  as  well  as  the  elementary  full-line 
limit  Ho(r;  -oo,+oo)  =  2K0<p/p. 

It  may  be  verified  that  suitable  half-line  vector  potentials  are 

A0 (r;  a,  +oo)  =  -z  K0  In  [Ra  -  (z  -  a)] .  and  A0 (r;  -oo,  /?)  =  -z  K0  In  [R0  +  (z  -  0)]  -  (4) 

Suitable  vector  potentials  for  the  whole  or  finite  segments  [a,  0\  of  the  z-axis  are  then  given  by 

A0(r;-oo,+oo)  =  A0(r;-oo,7)  +  ^4o(T’;7)+'»)  =  -zKq  In  (p2)  .  (5) 

11  Q  Ao(r\a,0)  =  A0(r;a,+oo)  -  Ao(r;0,+  oo)  =  A0(r;  -oo,/?)  -  A0(r;  -oo,a)  (6) 

Magnetic  field  and  vector  potential  expressions  for  arbitrarily  oriented  elements  are  easily  derived  from  the  above 
using  translation  and  rotation  of  the  coordinates. 
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To  account  for  the  three-phase  source  current,  each  field  quantity  is  represented  the  complex  spatial  vector 
form  C(r)  =  Cr  (r)  +  jci  (r),  having  an  associated  real  field, 

c  (r,  t)  =  5ft  {C  ( r )  eJU't}  =  (r)  cos(wf)  -  c4  (r)  sin(wf),  (7) 

which  is  the  quantity  of  physical  interest.  It  can  be  shown  that  the  temporal  minimum,  maximum  and  root-mean- 
square  (over  one  period)  field  amplitudes  at  the  point  r  are 

c<  (r)  =  ominr ||c (r,  t) ||  =  y/\pc  (r)  -  qc  (r)]  /2,  c>  (r)  =  mex, ||c (r,i)||  =  Vbc  (r)  +  qc  (p)J  /2,  (8) 

and 

Cfl  (r)  =  ]j^Jo  l|c(r,t)|t2dt  =  where  pe  (r)  s  \\C  (r)||2  and  qc  (r)  -  j C (r)  C (r)|  -  (9) 

The  numerical  solution  of  the  boundary  value  problem  is  described  elsewhere  (Dawson  et  al,  1997).  The  only 
substantial  differences  between  the  earlier  uniform  field  calculations  and  the  present  ones  are  the  choice  of  applied 
magnetic  vector  potential  to  model  the  nonuniform  fields,  and  the  allowance  for  multiphase  sources.  Since  the 
boundary  value  problem  has  real  coefficients,  the  excitation  due  to  the  in-phase  and  quadrature  components  of  the 
source  can  be  computed  separately.  The  resulting  solutions  can  then  be  combined  at  the  post-processing  stage  to 
yield  the  complex  induced  field. 


3  Results  and  Discussion 

The  evaluation  of  a  given  configuration  requires  computation  of  approximately  1.7  x  106  electric  field  and  associated 
current  density  vectors,  distributed  over  approximately  80  elementary  tissues.  The  temporal  behaviour  at  a 
single  point  can  be  characterized  by  the  minimum,  maximum  and  root-mean-square  values  over  one  period.  To 
describe  the  spatial  behaviour,  the  elementary  tissues  are  aggregated  into  composite  tissue  groups  (e.g.  organs) 
as  appropriate.  Scalar  spatial  dosimetric  data  of  either  a  local  (minimum  and  maximum)  or  a  global  (average, 
root-mean  square)  nature  can  then  be  computed  for  each  tissue  group.  The  two  global  measures  for  a  spatial 
vector  field  c«  (r)  are  defined  analytically  as 

Avg {c»  (r)}  =  l^||c.  (r)j|  dV ,  and  Rms{c,  (r)}  =  j|c,  (r)\\2dV.  (10) 

The  integrations  are  taken  over  the  full  volume  of  the  particular  organ  under  consideration,  and  are  approximated 
by  discrete  sums  over  the  appropriate  voxel  data. 

3.1  External  field  measures 

Table  1  shows  the  four  whole-body  volumetric  measures  of  the  external  field  based  on  temporal  rms  values.  The 
Table  1:  Whole-body  external  field  measures  (mT) 


Min. 

Max. 

Avg. 

Rms. 

Vault 

0.01 

3.67 

0.05 

0.15 

Generator 

0.71 

2.47 

1.50 

1.55 

table  indicates  that  the  vault  scenario  exposes  the  body  to  the  highest  peak  fields,  due  to  the  close  proximity  of  the 
hands  to  the  conductors.  However,  conductors  in  the  generator  scenario  expose  the  worker  to  significantly  higher 
whole-body  fields,  as  indicated  by  the  average  and  spatial  rms  values.  Also,  this  latter  scenario  has  a  much  higher 
whole-body  minimum  source  field.  The  significant  discrepancy  of  the  spatial  average  and  rms  measures  for  the 
vault  scenario  is  indicative  of  a  high  degree  of  field  nonuniformity,  largely  due  to  the  close  proximity  of  the  body 
to  the  conductors.  The  source  field  is  more  uniform  over  the  body  in  the  generator  scenario,  as  indicated  by  the 
similar  spatial  average  and  rms  measures. 
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3.2  Induced  Field  Dosimetry 

Table  2  summarizes  the  spatial  maximum,  average  and  rms  induced  electric  field  (columns  2-4)  and  current  density 


Table  2:  Selected  tissue  dosimetry  for  the  vault  maintenance  scenaria 


ll-EII  (mV  m-i) 

IIJII  (mAm  -) 

1  !•» II  ) 

Max 

Avg 

Rms 

Max 

Avg 

Rms 

Min 

Max 

Rms 

blood 

1.66 

0.25 

0.31 

1.16 

0.17 

0.21 

0.013 

0.708 

0.053 

marrow 

9.95 

0.41 

0.63 

0.50 

0.02 

0.03 

0.011 

1.125 

0.100 

brain 

3.99 

0.57 

0.65 

0.40 

0.05 

0.06 

0.039 

0.065 

0.050 

1.45 

0.33 

0.40 

2.90 

0.66 

0.80 

0.039 

0.065 

0.049 

heart 

2.22 

0.64 

0.72 

0.22 

0.06 

0.07 

0.032 

0.044 

0.038 

kidneys 

lungs 

1.44 

0.37 

0.42 

0.14 

0.04 

0.04 

0.020 

0.030 

0.025 

2.07 

0.46 

0.54 

0.17 

0.04 

0.04 

0.023 

0.048 

0.035 

muscle 

11.09 

0.41 

0.56 

3.88 

0.14 

0.20 

0.010 

2.630 

0.111 

prostate 

spleen 

testes 

0.58 

0.23 

0.25 

0.23 

0.09 

0.10 

0.016 

0.018 

0.017 

1.20 

0.45 

0.49 

0.12 

0.05 

0.05 

0.028 

0.037 

0.032 

0.83 

0.29 

0.33 

0.33 

0.12 

0.13 

0.017 

0.020 

0.018 

whole  body 

25.03 

0.55 

0.93 

3.88 

0.09 

0.15 

0.010 

3.665 

0.147 

(columns  5-7)  dosimetry  in  selected  organs  for  the  vault  scenario,  based  on  the  temporal  rms  fields  (9a).  Associated 
spatial  minimum,  maximum  and  rms  values  of  the  source  magnetic  field  are  also  shown  in  columns  8  through  10. 

The  dosimetric  data  are  largely  consistent  with  the  external  field  measures;  namely,  tissues  that  are  close  to  the 
current-carrying  conductors  (bone  and  muscle,  whole  body)  have  high  maximum  induced  electric  field.  These  also 
have  disparate  values  of  the  spatial  average  and  rms  measures,  indicative  of  a  high  degree  of  source  nonuniformity. 
Other  organs  have  more  similar  average  and  rms  spatial  measures.  The  electric  fields  measures  are  generally  m 
the  0  2-0  5  mV  m"1  range.  Induced  current  densities  are  generally  below  1  mA  m  ,  except  for  a  few  maximum 
values  associated  with  tissues  close  to  the  conductors  or  having  high  conductivity  values.  However,  all  average 
and  rms  values  are  less  than  0.2  mA  m"2,  with  the  exception  of  cerebrospinal  fluid  (csf).  Thus,  this  particular 
occupational  exposure,  when  accurately  modeled,  is  associated  with  induced  current  densities  that  are  well  below 
the  10  mA  m"2  threshold  stipulated  in  a  recent  guideline  [ICNIRP ,  1998]. 

Table  3  presents  analogous  data  for  the  generator  scenario.  The  dosimetric  data  are  again  generally  consistent 


Table  3: 

Selected  tissue  dosimetry  for  the  generator  inspection  scenario. 

||£l||  (mV  m 

-1) 

|jj||  (mA  m' 

-2) 

\\B\\  (mT) 

Max 

Avg 

Rms 

Max 

Avg 

Rms 

Min 

Max 

Rms 

blood 

100.0 

12.7 

15.0 

70.04 

8.89 

10.50 

0.78 

2.43 

1.67 

marrow 

149.6 

19.9 

26.1 

7.48 

0.99 

1.30 

0.74 

2.10 

1.40 

brain 

155.7 

24.6 

27.7 

15.57 

2.08 

2.43 

2.03 

2.44 

2.23 

csf 

55.6 

13.7 

16.2 

111.20 

27.35 

32.44 

2.03 

2.44 

2.22 

heart 

76.8 

22.2 

25.3 

7.68 

2.22 

2.53 

1.77 

1.95 

1.86 

kidneys 

120.1 

30.4 

34.0 

12.01 

3.04 

3.40 

1.49 

1.71 

1.61 

lungs 

136.2 

29.8 

33.9 

10.89 

2.38 

2.71 

1.56 

2.00 

1.81 

muscle 

183.2 

19.8 

24.4 

64.12 

6.93 

8.54 

0.71 

2.41 

1.47 

prostate 

48.6 

17.4 

18.5 

19.44 

6.96 

7.41 

1.35 

1.41 

1.38 

spleen 

120.9 

56.7 

58.5 

12.09 

5.67 

5.85 

1.66 

1.86 

1.75 

testes 

67.4 

18.1 

20.6 

26.97 

7.22 

8.24 

1.35 

1.48 

1.40 

whole  body 

523.2 

26.3 

33.7 

111.20 

4.56 

6.59 

0.71 

2.47 

1.55 

with  the  exposure  field  measures.  There  are  smaller  differences  between  the  average  and  rms  fields,  indicative  of  a 
more  uniform  external  field  over  the  body  in  comparison  with  the  vault  scenario.  On  the  other  hand,  the  relatively 
higher  source  field  levels  produce  higher  induced  quantities. 
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In  many  organs  the  average  electric  fields  are  of  the  order  of  20  mV  m-1.  Only  in  a  few  cases  do  the  spatial 
average  or  rms  induced  current  densities  exceed  10  mA  m-2.  The  limit  of  10  mA  m~2  is  exceeded  by  the  maximum 
values  in  several  tissues.  It  may  be  noted  than  the  maximum  values  are  overestimated  by  perhaps  10-25%  due 
to  the  staircasing  inherent  in  the  human  body  voxel  model.  On  the  other  hand,  this  scenario  assumes  a  1.2-m 
body/conductor  separation;  significantly  higher  fields  will  be  induced  at  closer  separations. 


3.3  Comparison  with  Uniform  Field  Exposure 

Table  4  presents  normalized  values  of  the  spatial  average  (E^)  and  maximum  (E>)  induced  electric  fields  in  selected 

Table  4:  Selected  tissue  dosimetry,  normalized  to  the  whole-body  average  applied  magnetic  field,  for 
the  vault  maintenance  and  generator  inspection  scenarios,  and  for  a  uniform  field  exposure. 


Vault 

Generator 

Uniform 

B, 

E„  E> 
(V/m/T) 

B „ 

K  E> 
(V/m/T) 

K 

K  e> 

(V/m/T) 

blood 

0.7 

4.9 

32.8 

1.1 

8.4 

66.5 

1.0 

8.2 

81.8 

marrow 

0.8 

8.0 

196.5 

0.9 

13.2 

99.4 

1.0 

16.1 

152.3 

brain 

1.0 

11.3 

78.9 

1.5 

16.4 

103.5 

1.0 

11.7 

72.8 

csf 

1.0 

6.5 

28.7 

1.5 

9.1 

36.9 

1.0 

7.5 

38.7 

heart 

0.7 

12.7 

43.9 

1.2 

14.8 

51.0 

1.0 

19.2 

68.3 

kidneys 

0.5 

7.3 

28.4 

1.1 

20.2 

79.8 

1.0 

25.2 

72.0 

lungs 

0.7 

9.0 

41.0 

1.2 

19.8 

90.5 

1.0 

20.8 

85.6 

muscle 

0.9 

8.1 

219.0 

1.0 

13.2 

121.8 

1.0 

14.9 

144.8 

prostate 

0.3 

4.6 

11.5 

0.9 

11.6 

32.3 

1.0 

17.0 

53.6 

spleen 

0.6 

8.9 

23.7 

1.2 

37.7 

80.3 

1.0 

41.2 

91.2 

testes 

0.4 

5.7 

16.3 

0.9 

12.0 

44.8 

1.0 

14.8 

73.5 

whole  body 

1.0 

10.8 

494.3 

1.0 

17.5 

347.7 

1.0 

18.9 

368.8 

tissues  for  the  present  vault  and  generator  scenarios,  as  well  as  for  corresponding  values  taken  from  earlier  uniform- 
field  exposure  [ Dawson  and  Stuchly,  1998].  The  data  are  normalized  by  the  external  magnetic  field  whole-body 
averages  (Table  1),  and  so  have  units  of  V  m"1  T-1.  The  column  headed  indicates  the  associated  normalized 
values  of  the  organ-average  source  magnetic  field;  these  data  are  therefore  dimensionless  and  represent  fractions  of 
the  whole  body  average. 

The  following  observations  can  be  made  based  on  these  data:  (i)  The  average  induced  electric  fields  in  all 
organs,  except  for  the  brain  and  csf  in  the  generator  scenario,  are  lower  than  those  for  the  comparable  uniform 
field  exposure,  and  (ii)  The  rms  magnetic  field  within  the  brain  in  the  generator  scenario  (2.23  mT,  Table  3) 
is  50%  higher  than  the  whole-body  value  (1.55  mT),  and  is  associated  with  an  average  induced  electric  field  of 
16.4  V  m-1  T-1.  This  value  may  be  compared  with  an  average  electric  field  of  17.6  mV  m-1  that  would  be 
produced  by  a  uniform  1.5-mT  magnetic  field  (Table  4). 

These  results  have  a  useful  practical  application.  For  nonuniform  magnetic  field  exposures,  the  expected 
average  induced  fields  in  a  given  organ  may  be  estimated  using  the  associated  organ-average  value  of  the  non- 
uniform  magnetic  field  in  conjunction  with  the  previously-computed  fields  induced  by  a  uniform  magnetic  field  of 
the  same  strength. 

The  true  values  induced  in  organs  obtained  from  full  non-uniform  source  computation  are  not  likely  to  exceed 
the  estimated  values.  Exceptional  cases  may  be  associated  with  a  part  of  a  large  organ  being  in  a  much  stronger 
field  than  the  average. 
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4  Conclusions 

Human  exposure  to  nonuniform  magnetic  fields  from  spatially  complex  conductor  configurations  has  been  modeled 
using  a  scalar  potential  finite  difference  (SPFD)  computer  code  with  proper  modifications.  Dosimetric  data  m 
terms  of  various  spatial  measures  of  the  induced  electric  fields  and  current  densities  in  a  high-resolution  model  of 
the  human  body  have  been  computed  for  two  realistic  exposure  scenarios. 

The  vault  maintenance  scenario  is  associated  with  induced  current  densities  that  are  well  below  a  10  mA  m“2 
threshold.  For  the  generator  scenario,  on  the  other  hand,  spatial  average,  rms  or  maximum  values  in  several  organs 
exceed  this  threshold. 

A  comparison  of  the  full  non-uniform  source  dosimetric  data  with  equivalent  uniform  magnetic  exposure  values 
shows  that  the  average  induced  fields  and  currents  in  organs  are  generally  smaller  for  the  non-uniform  exposures. 
For  organs  that  are  exposed  to  fields  significantly  stronger  than  the  whole-body  average  value,  the  average  field 
within  the  volume  of  the  organ  should  be  considered.  Higher  maximum  values  of  the  induced  fields  occur  in 
tissues  that  are  in  magnetic  fields  much  stronger  than  the  rest  of  the  body,  e.g.  in  tissues  of  the  hands  close  to 
conductors.  Overall,  these  comparisons  suggest  the  possibility  of  using  the  dosimetry  data  for  uniform  magnetic 
fields  together  with  accurate  spatial  maps  of  the  non-uniform  exposure  field  within  the  body  volume  for  reasonably 
reliable  estimation  of  the  organ  and  tissue  induced  electric  fields  and  currents  under  non-uniform  field  conditions. 
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Characterization  of  Near  Electromagnetic  Fields 
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Abstract 

This  paper  presents  characterization  of  near  field  distributions  radiated  from  hand-held  communication 
devices  calculated  with  the  Numerical  Electromagnetic  Code1.  A  wire-grid  model  of  a  hand-held 
cellular  telephone  was  developed  and  validated  with  extensive  measurements.  This  wire-grid  model 
was  then  used  to  examine  four  antenna  types  and  the  differences  in  electromagnetic  radiation  exposure 
based  on  electric  field  intensity  in  the  projected  location  of  the  head  and  hand.  The  four  candidate 
antennas  are  the  quarter-wave  monopole,  half-wave  dipole,  inverted-F  antenna,  and  a  half-loop 
antenna. 

The  results  indicate  that  all  candidate  antennas  had  localized  areas  where  the  IEEE  Maximum 
Permissible  Exposure  (MPE)  standard  for  total  body  immersion  are  exceeded.  Of  the  four  antennas 
examined,  the  half-wave  dipole  had  minimum  exceeded  area  in  the  projected  head  space.  The  half¬ 
wave  dipole  also  excites  less  currents  on  the  hand-held  shell,  allowing  the  hand  to  grasp  the  hand-held 
with  minimum  absorption.  Therefore,  the  maximum  useable  power  is  radiated  from  a  half-wave 
dipole  antenna  due  to  minimum  power  absorbed  by  the  head  or  hand. 

I.  Introduction 

Hand-held  communication  devices  have  been  at  the  research  forefront  with  their  wide  spread  use  and 
concern  of  possible  health  risk  from  electromagnetic  absorption.  An  early  paper  points  out  that  the 
human  body  has  different  specific  absorption  rates  (SAR)  along  its  surface  with  the  highest  levels 
around  the  head  and  the  neck2.  This  work  was  extended  by  research  into  SAR  as  a  function  of  hand¬ 
held  proximity  to  the  head3  and  head  shape4.  Electromagnetic  energy  radiated  by  a  hand-held  radio 
integral  antenna  is  a  function  of  antenna  and  radio  shape,  excitation  point,  antenna  attachment,  and 
scatterers  and  absorbers  in  the  near  field.  This  paper  focuses  on  near  field  electromagnetic  intensity  as 
a  function  the  antenna  type,  location,  and  size.  Four  antennas  are  analyzed  for  use  with  mobile  cellular 
telephone  systems. 

The  typical  antenna  used  with  a  hand-held  cellular  radio  is  an  integral  monopole.  Hand-held  radios  are 
usually  supported  by  a  hand  when  in  use.  The  integral  monopole  creates  currents  on  the  hand-held 
shell,  the  counterpoise.  The  grasping  hand  has  a  profound  effect  on  the  radio's  communication 
efficiency  due  to  impeding  and  altering  surface  current's  direction  and  magnitude.  In  fact,  the 
interaction  of  the  head,  hand,  and  body  have  several  effects  that  require  investigation:  1)  radiation 
efficiency,  2)  radiation  pattern  distortion,  3)  compliance  with  standards,  and  4)  specific  absorption  rate. 

This  paper  examines  alternative  antenna  types  from  the  view  point  of  electromagnetic  radiation  in  the 
projected  head  space.  The  first  goal  is  to  construct  a  wire-grid  numerical  model  and  validate  with 
empirical  data,  and  then  use  the  numerical  model  to  determine  the  ideal  antenna  type  and  mounting 
location.  Four  simple  antennas  were  compared:  quarter-wave  monopoie,  haif-wave  dipoie,  inverted-F, 
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and  a  half-loop.  This  paper  determines  the  electromagnetic  field  strengths  close  to  these  simple 
antenna  structures  and  compares  the  values  to  the  IEEE  RF  safety  standard5. 

II.  Maximum  Permissible  Exposure 

According  to  the  IEEE  RF  safety  standard,  safety  to  personnel  from  electromagnetic  fields  and  contact 
with  conductive  objects  exposed  to  such  fields  is  a  function  of  body  resistance  and  capacitance  to 
ground5.  The  work  performed  was  confined  to  measurements  and  numerical  modeling  of  the 
electromagnetic  fields  associated  with  small  wireless  systems  and  it  did  not  address  thresholds  on 
health  hazards.  More  specifically,  we  were  attempting  to  determine  by  empirical  and  numerical 
models  of  simple  antennas  whether  the  set  standards5  were  exceeded 

The  current  IEEE  standard  provides  the  Maximum  Permissible  Exposure  (MPE)  in  terms  of  rms 
electric  (E)  and  magnetic  (H)  field  strengths,  power  density  (S),  and  induced  currents  (I)  in  the  human 
body.  The  MPE  to  electric  fields,  magnetic  fields  and  power  density  shown  in  Figure  1  are  for  the 
equivalent  vertical  cross-sectional  area  of  the  human  body.  Two  sets  of  curves  are  displayed: 
controlled  (people  aware  of  potential  exposure)  and  uncontrolled  (people  unaware  of  potential 
exposure).  These  MPE  field  levels  are  set  to  limit  the  maximum  surface  current  over  the  body  to  100 
mA  through  each  foot  to  ground.  Over  a  six  minute  time  frame,  this  results  in  about  144  J/kg  in  an 
average  68  kg  person.  This  is  equivalent  to  a  specific  absorption  ratio  (SAR)  of  about  0.4  W/kg 
averaged  over  the  entire  body  mass  or  spatial  peak  of  8  W/kg  averaged  over  1  gram  of  tissue,  cubic  in 
shape. 

The  power  density  is 
calculated  from  the  electric 
and  magnetic  radiated  fields. 

In  the  far  field,  several 
wavelengths  from  the 
antenna,  the  ratio  of  the 
electric  field  to  magnetic 
field  is  equal  to  the  intrinsic 
impedance  of  free  space, 

120a  Q.  In  the  near  field, 
this  relationship  is  not  true, 
and  it  is  difficult  to  calculate 
the  MPE  from  an  individual 
electric  or  magnetic  field, 
therefore  both  must  be 
measured  separately.  The 
power  density  in  watts  per 
square  meter  is  equal  to  the 
electric  field  in  volts  per 
meter  multiplied  by  the  magnetic  field  in  amps  per  meter.  However,  the  IEEE  standard  allows 
freedom  above  300  MHz  and  requires  only  one  field  to  be  measured,  generally  the  electric,  and  then 
the  free-space  equivalent  power  densities  (S)can  be  computed,  equation  1, 


E 

> 


S  10.00 


10  100 
Frequency  (MHz) 


Figure  1.  Maximum  Permissible  Exposure  Standard  for  RMS 
Electromagnetic  Field  Strengths  and  Power  Density  from  1  to  1000  MHz 
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(1) 


S(E)  =  and  S(H)  =  H2  *1 20 x  for  frequencies  >300  MHz 

These  MPE  power  densities  are  for  the  equivalent  vertical  cross-sectional  area  of  the  human  body. 
Permitted  exposure  levels  apply  to  all  parts  of  the  body  except  the  extremities.  Arms,  hands,  legs,  and 
feet  contain  no  vital  organs  and  provide  more  surface  area  that  enhances  cooling.  For  this  reason, 
extremities  can  be  exposed  to  up  20  W/kg  averaged  over  any  10  gram  cube  of  tissue.  There  is  an 
exclusion  in  the  IEEE  safety  standard  for  low-powered  devices.  The  MPE  may  be  exceeded  if  the 
radiated  power  is  less  than  7*450//(MHz)  (controlled)  or  1.4*450//(MHz)  (uncontrolled)  watts. 
However,  this  exclusion  does  not  apply  if  the  antenna  is  within  2.5  cm  from  the  body. 

HI.  Model  Development  and  Validation 

The  Numerical  Electromagnetic  Code  Version  41  (NEC-4)  is  a  well-known  implementation  of  the 
moment  method6  algorithm.  NEC-4  is  considered  the  best  method  of 
moments  electromagnetic  modeling  tool,  having  achieved  this  status  through 
constant  improvement  and  extensive  validation  by  many  independent  users. 

NEC-4  computes  the  current  distribution  on  a  segmented  wire-grid  or  surface 
patch  model  of  the  antenna  and  platform  structures.  The  wire-grid  models 
were  created  using  an  interactive  graphical  user  interface,  EAMrNEC7.  The 
current  distribution  in  each  wire-grid  segment  is  then  used  to  compute  near 
electric  and  magnetic  field  strengths. 

A  numerical  model  was  developed  for  a  plastic  coated  hand-held  cellular 
radio  with  integral  monopole  antenna.  Figure  2  displays  the  wire-grid  model 
that  was  used  to  compute  the  electromagnetic  fields.  The  wire-grid  model 
contains  117  wire  segments  to  model  the  aluminum  back  plane,  LCD/speaker 
enclosure,  antenna,  and  battery.  The  model  was  segmented  so  that  there  are 
approximately  15  segments  per  wavelength  at  850  MHz.  The  model  is 
situated  such  that  the  11.0  cm  monopole  is  aligned  with  the  z-axis  in  the 
Cartesian  coordinate  system.  For  validation  purposes,  the  near  electric  field 
was  calculated  every  10  cm  from  -1 .0  to  1 .0  m  in  the  x-  and  y-axis  and  from 
1.0  to  2.0  m  in  z-axis  creating  two  orthogonal  planar  views  that  bisect  at  the 
hand-held. 

For  accurate  comparison  to  the  model,  the  cellular  phone  was  situated  in  a 
large  open  area  free  of  metallic  obstacles  at  a  height  of  150  cm.  The  hand¬ 
held’s  vertical  dimension  was  15  cm  and  the  antenna  was  11  cm  long. 

Measurements  were  performed  in  linear  paths  on  the  axis  away  from  the 
radio  centered  on  the  antenna  at  110,  130,  150,  160,  and  170  cm  high.  The 
electric  field  strength  was  measured  using  a  small  3 -axis  probe8.  The  probe 
was  connected  to  the  readout  by  a  length  of  fiber  optic  cable  to  minimize 
measurement  interaction.  The  readout  provided  the  summation  of  all  three 
vector  components,  therefore  minimizing  fluctuations  due  to  probe 
orientation. 
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distance,  empirical 

Figure  4  displays  the  empirical  data  in  a  different  format,  orthogonal  planar  views  of  electric  field 
strengths  as  amplitude  contours.  The  left  hand  side  is  the  y-z  plane  slicing  through  the  hand-held  from 
back-to-front.  The  right  hand  side  is  the  x-z  plane  slicing  through  from  left-to-right.  The  largest 
electric  field  occurs  near  the  radio  between  a  height  of  150  and  165  cm.  The  measurement  resolution 
was  10  cm.  Closer  to  the  radio  and  antenna  the  values  had  a  steep  gradient  similar  to  10  cm  to  40  cm. 
At  this  reduced  power  level  it  is  estimated  that  the  peak  electric  field  reaches  100  V/m. 


O 


Figure  4.  Measured  Electric  Field  From  Cellular  Phone  in  Two  Orthogonal  Planes 
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The  same  measurements  were  performed  with  a  subject’s  hand  around  the  phone  and  the  left  ear 
against  the  hand-held.  Figure  5  displays  the  orthogonal  planar  views.  It  was  impossible  to  measure 
the  fields  within  the  subject.  Therefore,  the  fields  are  displayed  as  zero.  If  we  look  at  the  contour  area 
edges  where  the  field  drops  below  3  V/m  (dark  to  light  gray),  the  contour  overall  size  is  reduced  when 
the  subject  is  present.  The  body  interacts  with  the  test  object’s  radiated  power  and  absorbs  power 
resulting  in  reduced  electric  field  values.  The  exact  amount  of  absorbed  power  by  the  body,  head,  and 
hand  is  difficult  to  measure  without  complete  2%  steradians  of  measurements,  or  measuring  the  surface 
current  induced  onto  the  body. 


c;  r.  ©  *- 


Figure  5.  Measured  Electric  Field  From  Cellular  Phone  in  Two  Orthogonal  Planes  with  Head  and  Hand 


Comparison  to  the  safety  standard  must  be  performed  without  the  test  subject  present.  Once  the 
subject  is  present,  the  subject  absorbs  power  resulting  in  lower  values. 

Papers  to  date2’3,4,9  have  analyzed  the  effect  internal  to  the  head  and  not  the  root  cause.  With  a 
validated  wire-grid  model,  the  next  section  will  investigate  different  antennas  and  their  near  electric 
field,  in  an  effort  to  reduce  the  head  and  hand  exposure. 

IV.  Antenna  Comparisons 

Four  alternative  antennas  were  analyzed  for  their  applicability  to  hand-held  usage  from  the  perspective 
of  electromagnetic  radiation  in  the  projected  head  space:  quarter-wave  monopole,  half-wave  dipole, 
inverted-F,  and  naif-loop.  Each  antenna  was  numerically  constructed  on  the  validated  wire-grid 
model.  Then  the  numerical  models  were  used  to  determine  the  electromagnetic  near  field  generated  by 
the  hand-held/antenna  structures  and  these  predicted  values  were  compared  with  the  IEEE  RF  safety 
standard.  To  analyze  near  the  projected  head  area,  the  near  fields  were  predicted  in  a  centimeter  grid 
that  covered  a  planar  area  that  encircled  the  hand-held  radio  and  antenna. 

The  near  electric  field  around  a  quarter-wave  monopole  is  shown  in  Figure  6.  The  hand-held  shell  and 
antenna  are  drawn  as  white  rectangles  in  the  correct  location.  The  white  oval  outline  represents  the 
projected  head  space.  The  power  level  is  now  adjusted  to  the  nominal  value  of  600  milliwatts  at  the 
antenna  port.  Electromagnetic  field  peaks  occur  in  three  locations:  the  antenna  tip,  antenna/hand-held 
interface,  and  the  hand-held  shell’s  base.  When  the  head  is  placed  near  the  hand-held,  it  is  exposed  to 
localized  electromagnetic  fields  that  exceed  the  IEEE  standard  for  total  body  exposure  of  100  V/m  (45 
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V/m)  controlled  (uncontrolled).  Within  this 
plane,  the  projected  head  space  encircles  32 
sq.  cm  that  exceed  100  V/m.  Approximately 
45%9  of  the  radiated  power  is  absorbed  by  the 
head,  resulting  in  a  SAR  of  2.93  W/Kg9.  The 
head  area  exposed  includes  skin,  muscle, 
bone,  and  gray  matter.  When  only  a  fraction 
of  the  body  is  exposed,  localized  SAR  can 
reach  8  W/kg  averaged  over  one  gram  of  body 
mass  (1.6  W/kg  uncontrolled).  This  equates 
to  approximately  275  V/m  (125  V/m)  for 
controlled  (uncontrolled)  localized  electric 
field  levels. 

An  undesirable  feature  seen  in  Figure  6,  is  the 
amount  of  electric  field  radiating  directly 
from  the  hand-held  shell.  The  electric  field 
lowest  level  is  above  50  V/m.  When  a  hand 
grasps  the  hand-held  shell,  surface  currents 
flow  over  the  hand  and  power  is  absorbed. 
For  the  monopole,  approximately  10%9  of  the 
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Figure  6  The  Near  Electric  Field  About  a 
Monopole  on  a  Hand-held  Radio 


radiated  power  is  absorbed  by  the  grasping  hand. 


The  shape  of  the  electric  field  contours  in  Figure  6  implies  that  radiation  pattern  directivity  is 
downwards  towards  the  ground.  Thus,  the  ground  will  absorb  power,  an  undesirable  characteristic  for 
the  radiation  pattern.  It  is  desirable  to  reduce  these  percentages  by  finding  an  antenna  configuration 
that  does  not  produce  peak  electromagnetic 
fields  within  the  white  oval  outline  and 
minimizes  currents  on  the  hand-held’s  shell. 

For  a  half-wave  center  fed  dipole  antenna. 

Figure  7,  the  electric  field  peaks  at  the  dipole 
ends,  although  these  peaks  exceed  the  MPE 
standard.  However,  isolation  of  the  antenna 
from  the  hand-held  shell  has  significantly 
reduced  the  size  of  the  exceeded  area  at  the 
shell’s  base.  This  antenna’s  detachment  has 
also  reduced  the  electric  field  along  the  hand¬ 
held  shell  to  15  V/m.  This  can  allow  the  hand 
the  grasp  to  radio  with  less  power  absorption. 

In  this  planar  cut,  only  5  sq.  cm  are  within  the 
projected  head  area,  appreciably  less  than  the 
32  sq.  cm  from  the  monopole.  The  shape  of 
the  electric  field  contours  implies  that 
directivity  is  slightly  upwards,  a  desirable 
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characteristic  for  a  broad-beamed  radiation 
pattern. 

The  inverted-F  antenna  is  a  monopole  bent  at 
90°  with  an  inductive  shorting  stub  to  balance 
the  increase  capacitance.  Figure  8  displays  the 
antenna  configuration  with  the  near  electric 
field  contours.  There  are  two  large  areas  that 
exceed  the  MPE  standard.  The  driven  element 
is  now  on  the  head  side  of  the  hand-held  shell. 

This  is  realized  by  a  large  area  within  the 
projected  head  space,  28  sq.  cm,  within  which 
the  MPE  standard  is  exceeded.  The  lowest 
level  of  electric  field  generated  from  the  hand¬ 
held  shell  is  50  V/m,  the  same  as  the 
monopole.  Grasping  this  hand-held  will 
significantly  reduce  the  EIRP,  as  discussed 
earlier.  The  electric  field  contour  shape 
implies  that  the  radiation  pattern  is  upwards 
towards  zenith.  These  results  indicated  that 
the  inverted-F  antenna  is  not  suitable  for  this  application,  at  least  not  in  the  analyzed  configuration. 

The  last  antenna  analyzed  is  a  half-loop  antenna,  the  antenna  is  driven  at  the  top  attachment  and 
shorted  to  the  bottom  of  the  hand-held.  Figure 
9,  displays  one  large  circular  area  that  exceeds 
the  MPE  standard.  The  half-loop  is  attached 
to  the  hand-held’s  shell,  therefore  it 
incorporates  the  hand-held  conductive  shell  as 
part  of  a  loop.  Current  flows  through  both 
sections  and  radiates  as  a  full  loop. 

There  is  a  large  area  that  exceeds  the  MPE 
standard  within  the  projected  head  space,  36 
sq.  cm.  The  exceeded  area  is  centralized  in  the 
projected  head  space,  therefore  the  SAR  region 
is  around  the  cheek  and  eyes,  but  not  gray 
matter.  Grasping  the  shell  with  a  hand  places 
the  hand  directly  at  the  peak  electric  field 
value.  The  electric  field  contour  shapes  imply 
that  the  radiated  power  is  toward  the  horizon. 

In  summary,  this  version  of  a  half-loop  has  the 
largest  area  of  exceeded  electric  field  within 
the  projected  head  space  and  has  the  highest 
exposure  to  the  hand. 


Figure  9  The  Near  Electric  Field  About  a 
Half-Loop  on  tbe  Hand-held 


y-axis 

Figure  8  The  Near  Electric  Field  About  an 
Inverted-F  on  the  Hand-held 
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V.  Conclusions 


A  number  of  factors  are  used  to  select  antenna  types  and  mounting  configurations  for  any 
communication  radio.  This  paper  focused  on  minimizing  electromagnetic  exposure  of  the  user.  To 
perform  a  numerical  antenna  type  comparison,  a  wire-grid  model  of  a  hand-held  cellular  telephone  was 
developed  and  validated  with  extensive  measurements.  Four  candidate  antenna  types  were  combined 
with  the  validated  wire-grid  model  and  examined  to  determine  potential  radiation  intensities  in  the 
projected  location  of  the  head  and  grasping  hand. 

The  goal  was  to  select  an  antenna  type  and  mounting  configuration  that  minimizes  the  electromagnetic 
exposure  of  the  user.  Two  key  points  were  found,  first,  minimization  of  the  RF  currents  on  the  shell 
reduces  power  absorbed  in  the  grasping  hand  as  well  as  undesirable  perturbation  of  the  antenna 
radiation  patterns.  Second,  understanding  that  near  field  radiation  intensity  occurs  at  the  antenna  ends 
and  not  at  the  antenna  feed  emphasizes  that  antenna  position  is  critical  to  minimizing  near-field 
intensity  within  the  projected  head  space. 

The  data  indicates  that  all  the  antennas  have  localized  areas  where  the  IEEE  MPE  standard  for  total 
body  immersion  for  both  controlled  and  uncontrolled  use  is  exceeded.  Of  the  four  candidate  antennas, 
the  half-wave  dipole  provided  the  lowest  electric  field  intensity  within  the  projected  head  space.  It 
also  had  the  minimum  current  on  the  hand-held  shell,  resulting  in  less  absorption  by  the  hand. 
Therefore,  maximum  useable  power  radiated  and  minimum  electromagnetic  exposure  from  the  hand¬ 
held  occurs  with  a  half-wave  dipole  due  to  minimum  power  absorption  by  the  head  and  hand. 

VI.  References 


1  G.  J.  Burke,  “Numerical  Electromagnetics  Code  -  NEC-4,  Method  of  Moments,”  Part  I:  User’s  Manual,  Lawrence 
Livermore  National  Laboratory,  Jan.  1992. 

2  M.A.  Stuchly,  R  J.  Spiegel,  S.S.  Stuchly,  and  A.  Kraszewski,  “Exposure  of  Man  in  the  Near-Field  of  a  Resonant  Dipole: 
Comparison  Between  Theory  and  Measurements,”  IEEE  Trans  on  MTT,  vol.  MTT-34,  no.  1,  January  1986. 

3  M.  Okoniewski,  M.A.  Stuchly,  “A  study  of  the  Handset  Antenna  and  the  Human  Body  Interaction,”  IEEE  Trans  on 
MTT,  vol.  MTT-44,  no.  10,  October  1996. 

*  V.  Hombach,  K.  Meier,  M.  Burkhardt,  E.  Kuhn,  and  N.  Kuster,  “The  dependence  of  EM  Energy  Absorption  Upon 
Human  Head  Modeling  at  900  MHz,”  IEEE  Trans  on  MTT,  vol.  MTT-44,  no.  10,  October  1996. 

5  tf.p?.  Standard  for  Safety  Levels  with  Respect  to  Human  Exposure  to  Radio  Frequency  Electromagnetic  Fields,  3  kHz  to 
300  GHz,  IEEE  C95.1-199I. 

6  RF.  Harrington,  Field  Computation  by  Moment  Methods,  Krieger  Publishing  Company,  1968. 

7  M.J.  Packer,  “Graphical  Shell  for  Numerical  Electromagnetic  Code,”  Applied  Computational  Electromagnetic  Society 
Symposium,  March  1993. 

*  Holaday  Industries,  Inc.,  HI-4433  Series  Broadband  Isotropic  Field  Probes. 

9  O.P.  Gandhi,  G.Lazzi,  C.F.Furse,  “Electromagnetic  Absorption  in  the  Human  Head  and  Neck  for  Mobile  Telephones  at 
835  and  1900  MHz,”  IEEE  Trans  on  MTT,  vol.  MTT-44,  no.  10,  October  1996. 


696 


Application  of  Multigrid  Method  for  Quick  Simulation  of  SAR  and 
Temperature  Distribution  During  Hyperthermia  Treatment  of  Cervical 

Malignancies. 

Michai  P.  D§bicki*  Michal  Mrozowski*and  Piotr  D§bicki* 


Abstract 

The  paper  presents  the  application  of  the  adaptive  multigrid  technique  to  the  solution  of  SAR  and  tem¬ 
perature  distribution  in  a  patient  undergoing  hyperthermia  treatment.  A  modified  brachytherapy  applicator 
is  used  as  a  hyperthermia  source  in  order  to  use  a  single  device  for  both  treatments.  Voltage  of  frequency 
500  kHz  is  applied  to  the  applicator’s  electrodes  and  thus  quasi-static  approximation  is  possible.  The  E-field 
distributions  for  basic  excitation  modes  are  obtained  by  a  solution  of  Laplace  equation  using  adaptive  multilevel 
finite  element  code.  Then  the  solution  for  an  arbitrary  excitation  is  obtained  by  summation  of  basic  modes  with 
adjusted  amplitudes  and  phases.  The  temperature  distribution  is  obtained  using  the  same  multigrid  approach. 
The  short  evaluation  time  of  the  modeling  enables  the  on-line  simulation  during  treatment. 


1  Introduction 

In  recent  years  multigrid  techniques  have  emerged  as  an  efficient  tool  for  the  solution  of  linear  equation  system 
resulting  from  some  discretization  scheme  such  as  finite  element  (FE)  or  finite  difference  (FD)  methods  [1, 2]. 
Their  effectiveness  can  be  even  further  improved  by  application  of  adaptive  grid  refinement  of  the  computational 
domain  [3].  These  methods  are  especially  valuable  for  the  high  resolution  meshes,  when  the  standard  iterative 
schemes  result  in  long  solution  times  due  to  poor  convergence.  In  this  paper  we  present  an  application  of  the 
adaptive  multigrid  approach  to  the  solution  of  quasi-static  boundary  value  problem  and  then  re-apply  obtained 
results  in  order  to  obtain  the  temperature  distribution.  Our  research  stems  from  the  need  for  fast  treatment 
planning  in  the  hyperthermia  of  cervical  malignancies. 

The  aim  of  a  hyperthermia  treatment  is  to  raise  the  temperature  of  a  treated  region  to  values  from  the  range  of 
43°C  to  48° C  in  order  to  slowly  destroy  tumor  cells.  The  hyperthermia  is,  in  most  cases,  performed  synergetically 
with  radiotherapy  as  a  means  to  enhance  the  response  of  cells  and  to  increase  the  possibility  of  an  effective 
treatment  [4,5]-  The  method,  most  commonly  used  for  the  treatment  of  cervical  malignancies,  is  brachytherapy 
(BT).  This  is  a  kind  of  radiation  therapy  in  which  radioactive  materials  are  placed  in  direct  contact  with  the 
tissue  being  treated.  This  type  of  therapy  is  applicable  for  those  tumours,  which  might  be  directly  accessed  with 
interstitial  or  intracavitary  brachytherapy  applicators.  The  BT  for  the  cervical  malignancies  is  usually  carried  out 
by  means  of  a  commonly  used  and  commercially  available  cervical  Fletcher-Suit  (FS)  applicator  for  afterloading 
devices  (Fig.  1).  This  applicator  consists  of  three  subunits  designed  to  accept  radioactive  Cs-137  pellets.  When 
the  applicator  is  assembled  and  placed  in  the  desired  treatment  position  the  Cs-137  sources  are  moved  into  their 
planned  place  according  to  the  treatment  plan.  The  source  placement  and  their  removal  is  accomplished  with  the 
use  of  compressed  air.  The  irradiation  lasts  for  about  10  hours. 

The  purpose  of  this  study  was  to  adapt  the  brachytherapy  applicator  as  a  hyperthermia  device  (in  order  to 
perform  both  HT  and  BT  treatments  imediately  one  after  another  and  to  reduce  the  trauma  associated  with 
positioning  the  applicator  within  a  patient’s  body)  and  to  investigate  the  possibility  of  an  effective  treatment  by 
software  simulations,  which  then  can  be  used  as  a  core  of  a  hyperthermia  treatment  planning  system.  The  idea  is 
to  complete  the  modeling  and  treatment  planning  while  the  patient  is  being  subjected  to  BT.  Such  an  approach 
requires: 

•Technical  University  of  Gdansk,  Department  of  Electronics,  Telecommunications  and  Informatics, 
ul.  Narutowicza  11/12,  80-952,  Gdansk,  tel.  (+  48  58)  347  2324,  fax.  (+  48  58)  347  12  28,  e-mail:  patryk@eti.pg.gda.pl 
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Colpostats 


Electrical  insulation 


Figure  1:  Modified  selectron  compatible  Fletcher-Suit  cervical  applicator.  The  colpostats  are  both  2  cm  in  diameter 
and  3  cm  in  length.  The  tandem  is  0.6  cm  in  diameter  and  curved  by  30  degrees.  The  tubes  supporting  the  colpostats 
and  the  proximal  part  of  the  tandem  are  covered  by  insulating  paint  (white)  in  order  to  prevent  health  tissues  from 
undesired  heating.  Each  of  the  subunits  are  electrically  insulated.  The  relative  position  of  the  colpostats  and  tandem 
depends  on  a  patient.  A  thin  metal  foil,  working  as  a  ground  electrode  is  additionally  placed  around  a  patient 


5Q4 

Figure  2:  The  description  of  the  domain.  Boundary  6fl0  represent  a  thin  metal  foil  around  a  patient’s  lower 
abdomen.  Boundaries  6Q i  to  Stt3  represent  applicator’s  electrodes  where  voltage  is  applied  and  SQ4  represent 
Neumann  boundary  conditions 


•  modification  of  the  FS  applicator  to  work  as  a  hyperthermia  applicator, 

•  a  fast  solver  to  carry  out  computations  for  each  patient  based  on  the  X-ray  pictures  of  a  FS  applicator 
positioned  inside  patient  body. 

Some  minor  changes,  which  enabled  the  use  of  the  FS  applicator  as  hyperthermia  source  are  presented' in  Fig.  1. 
These  include:  (a)  an  electrical  separation  of  the  three  subunits  in  order  to  function  as  three  separate  electrodes 
(b)  an  electrical  insulation  of  these  parts  of  the  applicator,  where  heating  is  not  desirable. 


2  Problem  Definition 

The  temperature  increase  during  hyperthermia  system  is  accomplished  by  dissipating  electric  current  flovdng 
between  the  modified  Fletcher-Suit  applicator  placed  in  the  treatment  position  aad  a  large  ^ernal  dectro 
surrounding  the  patient’s  lower  abdomen.  Schematic  configuration  is  presented  in  Fig.  2.  The  500  kHz  voltage  is 
applied  to  three  of  the  applicator’s  electrodes  (boundaries  ^  to  W,  while  external  electrode  is  always  grounded 
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(boundary  <5ft0)-  Since  low  frequency  field  is  used  we  may  apply  the  quasi-static  approach  and  find  the  electric 
field  inside  the  tissue  by  solving  the  following  boundary  value  problem: 


{V  •  (crVV)  =  0  infi 
V  =  0  on  sn0 

V-Vn  onSQn,  n  =  1,2,3 

Dr  =  o  °n  <5^4 


where  a  represents  the  conductivity  and  V  the  electric  potential.  Solving  (1)  for  V  we  obtain  the  electric  field 
distribution,  from  the  relation: 

E  =  — VV  (2) 

Power  absorbed  by  the  tissues  is  proportional  to  the  tissue  conductivity  and  a  square  of  the  electric  field  amplitude: 


(3) 


This  quantity  further  divided  by  tissue  mass  density  p  represents  the  specific  absorption  rate  (SAR): 


(4) 


which  expresses  the  amount  of  power  absorbed  by  mass  volume  of  tissue  and  thus  is  the  main  cause  of  temperature 
elevation. 

The  applicator’s  construction  (see  Figs  1  and  2)  implies  that  three  independent  modes  of  excitation  exist  -  001 
(left  electrode  is  excited,  while  others  are  grounded),  010  (central  electrode  is  excited,  while  others  are  grounded) 
and  100  (right  electrode  is  excited,  while  others  are  grounded).  The  outer  electrode  is  always  grounded.  Knowing 
the  solution  for  three  basic  modes  it  is  possible  to  find  E  -  field  distribution  for  arbitrary  excitation  using  a 
superposition  of  the  three  basic  mode  solutions 


3 

Etot  = 


(5) 


where  At  is  is  the  amplitude  and  0*  is  the  phase-shift  of  the  f-th  applicator’s  electrode. 

In  order  to  obtain  the  temperature  distribution,  the  Pennes  bio-heat  transfer  equation  [6]  for  a  stationary  case 
has  to  be  solved: 

-V-{kVT)  +  cbW(T-T„)  =  P  (6) 

where  fc  is  a  thermal  conductivity,  ct  is  tissue  specific  heat,  TV  is  a  volumetric  perfusion  rate  in  [kg  m-3  a-1],  Tb  is 
blood  temperature  and  P  is  the  power  density  obtained  from  (3). 

Boundary  conditions  (BC)  for  the  modeling  of  temperature  distribution  are  specified  for  the  domain  shown 
in  Fig.  2.  Assuming  the  constant  basic  body  temperature  the  boundary  <5£70  is  specified  as  Dirichlet  BC  with 
T  —  37 °C.  The  remaining  boundaries  are  assumed  to  have  the  temperature  of  adjacent  tissue  and  thus  the 
boundary  condition  is  set  as  the  Neumann  type  (material,  the  applicator  is  made  of,  is  characterized  by  small 
thermal  conductivity).  For  the  case  of  air  cooling  of  the  electrodes,  another  boundary  condition  of  the  Cauchy 
type  have  to  be  specified  on  cooled  surfaces  (£fii  to  8Q.3): 

k^  =  -a(T-Tair)  (7) 

where  a  is  a  heat  transfer  coefficient  and  Taif  is  a  temperature  of  cooling  air. 


3  Adaptive  Multigrid  Approach 

As  the  computation  domain  is  large  comparing  to  applicator  dimensions  and  the  configuration  is  complex  (see 
Fig.  1),  it  was  essential  to  use  a  fast  numerical  technique  to  solve  (1)  and  then  (6).  Note  that  the  simulation 
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D-R1  -  grid  refinement 
RI-D-R2-V -Cycle 
•  -  iteration(s)  (one  or  two) 
O  -  direct  solution 


Figure  3:  Schematic  representation  of  Full  multigrid  V-cycle. 


starts  after  the  X-ray  pictures  are  taken  and  has  to  be  completed  before  BT  treatment  is  finished.  To  this  end, 
a  multigrid  method  (MG)  based  on  finite  elements  (FE)  method  was  employed.  The  FE  discretization  scheme 
enables  the  use  of  irregular  tetrahedral  mesh,  which  may  be  well  fitted  to  conform  to  the  complicated  shape  of  the 
electrodes.  After  the  FE  discretization,  we  obtain  a  set  of  linear  equations,  which  may  be  written  m  a  matrix  form 


A u  =  / 


(8) 


where  A  is  a  sparse  and  banded  matrix,  /  is  a  vector  representing  voltage  on  the  electrodes  and  u  is  the  vector  of 

Thlequatlon  (8)  is  usually  solved  in  standard  approaches  by  some  iterative  schemes.  Unfortunately  these 
methods  may  suffer  from  poor  convergence,  especially  for  high  resolution  meshes.  . 

These  problems  are  obviated  in  multigrid  methods  [2,3],  where  the  initial  guess  for  the  first  iteration  step 
is  found  by  a  direct  solution  of  (8)  on  a  very  coarse  grid.  Then  the  mesh  is  alternately  refined  and  restricted 
(coarsened)  after  just  one  or  two  iteration(s)  on  every  grid  size.  The  restriction  is  performed  in  order  to  eliminate 
slowly  converging  components  of  the  solution  (smooth  error  modes  which  axe  difficult  to  eliminate  become  more 
oscillatory  on  a  coarser  grid  and  thus  are  more  successfully  eliminated).  Every  time  the  result  obtained  on  a  coarser 
grid  serves  as  an  initial  guess  for  a  finer  grid.  As  a  result  the  convergence  is  much  faster  than  in  a  regular  relaxation 
scheme  The  solution  of  the  problem  involving  N  unknowns  is  found  at  a  cost  of  0{N)  operations  [2]  for  a  typical 
multigrid  algorithm,  called  the  full  multigrid  V-Cycle  (FMV)  (Fig.  3).  If  used  with  adaptive  grid  refinement 
strategy,  the  algorithm  has  an  additional  useful  feature.  The  adaptive  techniques  are  the  methods,  which  perform 
a  local  mesh  adaptation  according  to  the  error  estimator,  which  indicates  where  the  domain  refinement  would 
be  computationally  most  profitable.  It  may  be  especially  effective  for  the  FMV  cycle,  as  the  algorithm  starts  on 
a  coarse  grid,  which  can  then  be  effectively  refined  using  an  adaptive  technique.  Such  an  approach  reduces  the 
memory  requirements  and  shortens  the  evaluation  time.  This  is  explained  by  the  fact  that  the  adaptively  refined 
grid  has  much  fewer  nodes  for  the  same  level  of  the  solution  error  then  the  one  refined  uniformly. 

4  Implementation 

The  KASKADE  system  [7, 8]  was  used  as  a  core  for  the  computer  implementation  of  the  adaptive  multigrid 
scheme.  The  system  accepts  the  initial  triangulation  of  the  problem  together  with  its  BC  as  an  input.  The 
automatic  triangulation  is  performed  using  the  QMG  [9]  mesh  generator  with  the  input  data  obtained  from  the 
two  X-ray  pictures  of  the  patient  with  the  positioned  FS  applicator.  Then  the  finite  element  discretization  is 
performed  and  the  resulting  set  of  equations  can  be  solved  using  one  of  the  implemented  methods.  These  include 
a  direct  ^-decomposition  of  the  system  matrix  A  (Harwell-MA28  sparse  matrix  solver)  or  iterative  procedures. 
Relaxations  routines  operate  on  a  residual  equation  of  the  original  system  and  use  different  kind  of  preconditioning 
in  order  to  improve  the  convergence  of  the  system.  More  specifically,  it  is  possible  to  choose  a  single  or  multilevel 
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Table  1:  Comparison  of  total  number  of  iterations,  solution  time  and  memory  allocation  for  adaptive  finite  elements 
and  multigrid  schemes. 


Total  problem  size:  105445  (initial  3962) 

Memory 

[MB] 

Number  of 
iterations 

Total 
time  [s] 

Nested  iteration  -  no  preconditioning 

222.6 

772 

74.45 

Nested  iteration  -  single  level  preconditioning 

222.6 

111 

19.05 

Full  multigrid  (multilevel  preconditioning) 

241.1 

56 

15.94 

Number  of  iterations  per  problem  size 


Problem  size  N 

Figure  4:  Number  of  iterations  (top)  and  solution  times  (bottom)  per  problem  size  (for  selected  refinement  levels). 
Each  refinement  level  uses  the  solution  of  its  preceding  level  as  an  initial  guess  and  is  referred  to  as  a  nested 
iteration  scheme.  It  can  be  seen,  that  the  number  of  iterations  significantly  increases  if  no  preconditioning  is 
applied  (except  for  the  first  level  with  arbitrary  initial  guess).  The  use  of  a  single  level  preconditioning  significantly 
improves  the  situation  resulting  in  almost  constant  number  of  iterations  per  problem  size.  The  application  of  a 
multigrid  preconditioner  results  in  the  decrease  of  the  number  of  iterations,  for  the  subsequent  refinements.  The 
solution  time  per  one  iteration  is  longer  for  multigrid  algorithm,  but  one  needs  to  keep  in  mind  that,  unlike  in  other 
methods,  the  number  of  iterations  decreases  with  each  refinement  step. 


preconditioners.  The  former  ones  are  a  typical  routines  of  Jacobi,  SSOR  or  ILU  preconditioning,  which  do  not 
change  the  resolution  of  the  system  matrix.  The  letter  ones  compute  the  coarse  grid  matrices  via  a  Galerkin 
procedure  or  by  simply  storing  the  system  matrices  for  selected  refinement  steps.  Since  adaptive  mechanism 
gives  different  final  meshes  for  each  of  3  excitations,  additional  mesh  processing  is  needed  in  order  to  use  the 
superposition  principle  (5).  Thus  additional  procedures  were  added  to  the  KASKADE  system.  These  procedures 
perform  grid  interpolation,  which  transforms  the  three  basic  solutions  to  a  single  grid.  Once  the  solutions  for  12- 
field  are  summed  up  with  prescribed  amplitudes  and  phases,  further  calculations  are  performed  giving  the  power 
(3)  or  SAR  (4)  distribution.  Such  an  approach  gives  the  flexibility  in  obtaining  an  arbitrary  solution  for  different 
amplitudes  and  phases  of  the  excitation  and  also  for  switching  between  two  distinct  modes  (i.e.  the  solution  when 
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Table  2:  Overall  solution  time.  CPU:  R10000,  195MHz. 


!  Grid  generation 

315  s. 

(5:15  min.) 

Grid  refinement 

88.6  s. 

E  -  field 

Solution 

12.2  s. 

distribution 
for  a  basic 
mode 

Error  estimation 

Mesh  interpolation 

100.8  s. 

57.3  s. 

Total 

314.7  s. 

(5:15  min.) 

Total  (3  basic  modes) 

944.1  s. 

(15:44  min.) 

|  Field  summation  and  calculation  of  SAR 

0.2  s. 

Grid  refinement 

6.3  s. 

Temperature 

Solution 

5.2  s. 

distribution 

Error  estimation 

45.5  s. 

Total 

70.6  s. 

(1:11  min.) 

Total 

1329.9  s. 

(22:01  min.) 

two  different  modes  are  interlaced  with  some  time  interval).  Obtained  results  for  the  power  distribution  axe  then 
again  interpolated  onto  initial  grid  of  the  thermal  problem  solution  (6),  which  is  again  solved  using  the  KASKADE 
system. 


5  Numerical  tests 

The  results  of  selected  numerical  tests  are  presented  in  Table  1  and  in  Figure  4.  It  can  be  mferred  that,  comparing 
to  nested  iteration  scheme,  the  full  multigrid  algorithm  significantly  improves  the  convergence  for  the  problem  at 
hand  with  the  decreasing  number  of  iterations  with  each  refinement  step  at  the  cost  of  some  additional  memory. 
Nevertheless,  the  solution  time  is  only  improved  for  the  last  refinement  step  for  the  largest  problem  size,  as  multigrid 
preconditioning  has  higher  numerical  cost  than  the  single-level  one.  This  however  is  profitable  for  large  number  of 
refinement  steps,  when  the  number  of  iteration  for  multigrid  scheme  is  much  lower. 


6  Results 


Times  required  for  modeling  the  SAR  and  temperature  distribution  are  presented  in  Table  2.  The  modeling  was 
performed  on  SGI  Onyx  2  computer  with  R10000  CPU,  195  MHz  and  2.5  GB  of  RAM.  Initial  grid  was  19120 
nodes  and  86528  tetrahedrals.  The  author’s  intention  is  to  use  a  different  grid  generator  in  the  future  m  order 
to  obtain  coarser  initial  grid  and  thus  to  improve  the  effectiveness  of  multigrid  scheme.  The  final  grid  resulted  in 
126211  points  and  704221  tetrahedrals.  Then  the  mesh  was  interpolated  on  a  regular  grid  with  336336  nodes. 

The  total  modeling  procedure  takes  22  minutes.  The  most  time  is  spent  in  a  solution  of  basic  modes  and 
grid  generation  Nevertheless,  those  are  calculated  only  once  and  the  adjustment  of  amplitudes  and  phases  are 
found  in  less  than  a  second.  Thus,  the  most  important  treatment  parameters  can  be  easily  adjusted  in  order  to 
obtain  desired  pattern  of  power  deposition.  Once  the  excitations  have  been  found,  the  simulation  of  temperature 


distribution  can  be  performed  within  a  minute.  ...  ,  ,  ...  _ 

Using  the  AVS*  system  for  the  presentation  of  the  results,  different  visualization  methods  are  possible,  figure  o 
presents  ISO-level  surfaces  of  the  SAR  distribution,  while  Figure  6  shows  examples  of  temperature  distribution 


'Advanced  Visual  Systems  Inc. 
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3)1^1=65,0,70  b)  |V|  —  85,85,85  c)  0.35  •  o)  +  0.65  -  b) 

$  =  0,0, 180  $  =  0,0,0 

Figure  6:  Example  of  solutions  obtained  for  different  amplitudes  and  phases  applied  to  the  electrodes  of  the  appli¬ 
cator.  Case  c)  is  obtained  by  switching  between  mode  a)  and  b)  with  proportion  of  35  to  65  percent. 
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for  selected  cross-sections  of  the  domain  and  for  different  excitation  modes. 

7  Conclusions 

A  fast  numerical  method  was  applied  to  determination  of  SAR  and  temperature  distribution.  This  approach 
enables  the  on-line  simulations  during  hyperthermia  treatment. 
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